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On the Independence Theory of 
Equalizer Convergence 


By J. E. MAZO 


(Manuscript received December 18, 1978) 


High-speed pulse amplitude modulated (PAM) data transmission 
over telephone channels is only possible when adaptive equalization 
is used to mitigate the linear distortion found on the (initially un- 
known) channel. At the beginning of the equalization procedure, the 
tap weights are adjusted to minimize the intersymbol interference 
between pulses. The “stochastic gradient” algorithm is an iterative 
procedure commonly used for setting the coefficients in these and 
other adaptive filters, but a proper understanding of the convergence 
has never been obtained. It has been common analytical practice to 
invoke an assumption stating that a certain sequence of random 
vectors which direct the “hunting” of the equalizer are statistically 
independent. Everyone acknowledges this assumption to be far from 
true, just as everyone agrees that the final predictions made using it 
are in excellent agreement with experiments and simulations. We 
take the resolution of this question as our main problem. When one 
begins to analyze the performance of the algorithm, one sees that the 
average mean-square error after the nth iteration requires knowing, 
as an intermediate step, the mathematical expectation of the product 
of a sequence of statistically dependent matrices. We transform the 
latter problem to a space of sufficiently high dimension where the 
required average may be obtained from a canonical equation Vn+1 
= VM(a)VY, + #. Here X(a) is a square matrix, depending on the 
“step-size”’ a of the original algorithm, and V, and ¥ are vectors. 
The mean-square error is calculable from the solution V ,,. 

Information about the solution of our equation is obtained by doing 
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matrix perturbation theory on &(a) for small values of a. We show 
that the first two terms of the perturbation solution contain, among 
their terms, the terms of the independence theory. Since the parameter 
a needs to be small even for independence theory to converge, agree- 
ment with an exact theory and experiment is obtained if, in some 
sense, the additional terms which appear in the perturbation solution 
may be disregarded. This will usually be the case. 


|. INTRODUCTION 


Adaptive equalization of telephone channels in order to facilitate 
high-speed data transmission has been successful ever since its intro- 
duction by Lucky in the 1960s. This technique uses a linear filter 
(configured as a tapped delay line) to remove the harmful effects of 
the linear channel distortion. At the start of the equalization procedure, 
a set of parameters, the tap weights, are adjusted so that the final 
setting of these taps minimizes the intersymbol interference between 
pulses in the data train. Many theoretical studies have been made 
concerning steady-state equalization after the optimum tap weights 
have been achieved; little analysis has been done concerning the 
convergence of the equalizer tap weights to their final settings. Even 
in the best published study on this problem (Ungerboeck, Ref. 1), it is 
necessary to invoke an assumption stating that a sequence of random 
vectors which direct the operation of the equalizer are statistically 
independent.{ This independence assumption will be explained more 
fully later; for the moment, we only indicate that it is not even 
approximately true. In fact, given the nth vector of the sequence, all 
but one component of the next vector will be exactly known. Yet if 
this assumption is made, surprising agreement with actual performance 
is obtained.’ Clearly, because of its importance, this situation begs for 
clarification. Hopefully, what we learn in equalization can be used for 
other applications where similar adaptive algorithms are used. In 
particular, the areas of linear prediction and adaptive array processing, 
both electromagnetic and sonar, come to mind. We concentrate our 
presentation on equalization, however, for here the author is sure of 
the details. 

We shall take as our performance criterion the expected value of the 
mean-square distortion, although the average error vector is also 
considered as a simpler problem. In particular, then, we are not 
concerned with the fluctuations which might occur in actual use. 


{ We are here concerned with convergence in random data, not with a known specially 
designed sequence. In usual startup operation, the data symbols are also assumed 
known, either by using a known sequence or by assuming that sufficiently accurate 
estimates are available. 
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Typically, the sample paths are close to the mean (see Ref. 1). na 
nutshell, our contribution to this problem consists of two parts. We 
first establish a time-independent difference equation which governs 
the average in question. This step is accomplished in a space of much 
higher dimension than one would initially assume. Second, examining 
the solution of this equation in a perturbation sense (the small “step- 
size” of the algorithm being the essential perturbation parameter), we 
find the leading terms contain the independence theory solution. 

Before delving into the abstract problem, we devote Section II to 
describing some more conventional aspects of data transmission and 
equalization and Section III to discussing the behavior of the mean- 
square error if the independence assumption is made. 


ll. DATA TRANSMISSION AND EQUALIZATION 


For our own convenience, we confine the discussion to binary 
baseband transmission and neglect the effects of additive noise. 

The equalizer, and in fact the entire detection procedure, operates 
on the samples of the baseband received signal r(¢), where 


r(t) = »» Am+kh(t — mT). 


If 1/T’ is the sampling rate, 1/T the symbol rate, a, the data symbols 
(iid, + 1 with equal probability) and A(t) the overall system impulse 
response, then these samples aret 


r(nT’) = Yo an+xh(nT’ — mT) n=0,1,2---. (1) 
For a synchronous equalizer, T’ = T and for a fractionally spaced 
equalizer, typically T’ = 7/2. If the coefficients of the equalizer are 
denoted by c;,t= 1, --+ , N (c; being also the zth component of a vector 
c) and the sequence of output samples of the equalizer are y,, then 
N 
yn = ¥ esr[(s — 1)T’ + nT] n=0,1,2,-+--. (2) 
s=1 
We call attention to the fact that, even when 7” # T, the equalizer 
samples are only of interest at multiples of the signaling interval T, 
and the notation of (2) takes this into account. We define a sequence 
(in time) of vectors X” such that the sth component of vector X’ is 


s= 1522, eee ,N 
XM =r[(s—1)T’ + nT] (3) 
n=0,1,2,---, 


+ We call the bit which “goes with” the mth pulse a,,+x (instead of the usual a,,) for 
later convenience. 
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and thus 
Yn = 0-X, (4) 


The implementation of (2) to (4) is shown in Fig. 1. 

Later, when we consider an adaptive equalizer, the taps will vary 
with time and ce” will be used for the sequence of tap-weight vectors. 
Ideally we would like (at least when n is large enough) the sequence of 
equalizer outputs to be the sequence of data symbols, except, perhaps, 
for a shift. For a finite equalizer (i.e., N finite) this ideal is not 
achievable, and instead the available taps are adjusted to minimize 
the average square error Ee?, where 

€n = Yn — An+K (5) 


and FE denotes the mathematical expectation with respect to the data 
symbols {a,}. If one introduces the N xX N channel autocorrelation 
matrix} (which is positive definite), 


A= EXO XO (6) 
and the vector, 
v = EFansxX”, (7) 


both of which do not depend on the time index n, then, for fixed taps 
c, the mean-squared error @ is given by 


& = E(yn — Qntx)” = C7’ Ac — 2e7v + 1. (8) 


Equation (8) shows @ to be a convex quadratic function of c. Any 
optimum choice of ¢c, say, c*, satisfies 


Ac* =v (9) 


which has a unique solution if A~' exists. We denote the minimum of 
& by &*. 

It will make little difference physically, and it will be a great 
convenience mathematically, if we pretend that the impulse response 
A(t) used in (1) has finite duration. Thus, assume 


h(t)=0 if |t|> AT. 

Let N, and N>2 be the largest integers such that 
NiT = HT (10a) 
(N —1)T’ — N.T = — AT. (10b) 


Further, choose the integer K in (1) to be N; and set M = N, + No + 
1, and let a” be an M-dimensional vector whose ith component is 


+ The superscript T always denotes transpose. 
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a, (OUTPUT) 





Fig. 1—Adaptive transversal equalizer, N = 5. 


al” = dn4i-1, 1 = 1, --- , M. Then using (3) and (1) we have 
X” = Ba”, (11) 
where in (11) B is an N X M matrix having elements 
1<i=N, 


By = h{((i — 1)T’ + (M+ 1-7)T), (12) 
l<j=sM. 


It follows from (10b) that M = Nif T’ = T and M>(N + 1)/2 if T” 
= T/2. 

The structure of the matrix B is illustrated below for the special 
case T’ = T, N= 3, M =7. 


Ap ky ho Ay he O 0 
B = 0 he hy ho h-y h_» 0 . 
0 0 he h, ho h_, h-2 


This structure means that X” has the same shifting property as a’. 


Thus, for example, in time sequence, 


-E-E 
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Since 


Ea™ a”? = T, (13) 
it follows from (6), (11), and (13) that 
A= EX™X™7T = BB’. (14) 
For the special case T’ = T, h(nT) = 6,0, then M = N, A = I, and 
An 
XM=a”=] | (15) 
Qn+N-1 


We now begin to describe the stochastic gradient algorithm used for 
equalizer convergence. But first we describe a different problem, the 
deterministic gradient algorithm, which is a method for finding the 
minimum on the surface .”, where 


S =cAc — 2c"v + 1. (16) 


This provides some heuristics for writing down the stochastic algo- 
rithm, but should not be confused with it. We take pains to point out 
some differences as we proceed, since many people substitute discus- 
sion of this algorithm for the actual one. 

Taking the gradient of (16) gives 


VS = Ac — v]. (17) 


Hence, if we were searching for a minimum of the function (16) by 
taking steps in the gradient direction, we would write the following 
equation for our position ec” at the nth stage 


et) = o™ — A(Ae™ — v), (18) 


A being a step-size parameter. Equation (18) coupled with (6) and (7) 
motivates the actual stochastic gradient algorithm used, namely, 


et) = ec ra af XX 7e™) _ Qn+Kx”] (19) 
= ce” a aenX”, (20) 


e, being the scalar error (5), and a the step-sizet. Thus in N-dimen- 
sional tap space we move in directions X”, where X” is [see (4)] the 
vector of values stored in the equalizer at time nT. Clearly, the allowed 


+ It is, of course, meaningless to speak of the “size” of a unless one fixes the size or 
scaling of the terms which multiply it in (20). We shall take the scaling of the latter so 
that, in the binary case, the matrix A [see (6)] has largest eigenvalue unity. 
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set of directions along which we “step” is, as (15) will testify, quite 
random and cannot be thought of as being gradient directions. Nev- 
ertheless, tradition dominates, and (19) and (20) are still referred to as 
a stochastic gradient algorithm. 

For our purposes, (19) may be rewritten slightly by introducing the 
error vector 


é” = ec” — ¢*, (21) 
Subtracting c* from both sides of (19) allows us to write 
et) = =(I- ax! XT) e& = a(e* TY (”) — On+K) xX. (22) 


Note the quantity c*7X — an+x is the instantaneous error if the 
optimum taps were used. This is normally quite small and would be 
zero if perfect equalization were possible. 

In terms of the e'”’, the mean-square error is 


G" = FF + TMQ” = St + OD. (23) 


In (23) the symbol £%? has been introduced for the excess mean-square 
error over é*. 

In (22) and (23), e” is random, and in fact depends on the entire 
sequence of data it since the adaptation began. Our measure of 
the progress of the algorithm will be Ef”, the average of the error at 
time n over all data sequences. 


il. THE INDEPENDENCE THEORY 


In this section we describe “independence theory,” an approximation 
used to mathematically treat the stochastic gradient algorithm de- 
scribed by (22). Use of the approximation allows one (as we shall see) 
to determine bounds on the step-size a which will ensure stability and 
allows calculations to be made on convergence rates. 

Independence theory treats the stochastic algorithm by assuming 
that the sequence X™ are statistically independent vectors. Since, 
from (22), e’” depends only on the sequence X"”, --- , X‘"~” (assuming 
we start with X"), we conclude e” and X™ are independent. For an 
example as to how this is applied, we look at the average error vector 
Ee. We have, from (22), (6), (7), and (9), 


Ee (n+1) = ([- aA)Ke™”. (24) 

If, for comparison, we introduce the error vector c, — c* for the 

deterministic theory and call it d” so no confusion can arise, we would 
have, subtracting e* from both sides of (18), 

qt) = (I- AA)d™, (25) 
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There is no question of an average in (25); d” is the error. In (24), 
Ee can be zero although the norm of e” can be quite large. 

To emphasize the difference further, let us return to the simple 
model (15) which describes an undistorted channel, for which perfect 
equalization is possible. Only the initial setting of the taps is wrong. 
For this case, we have [note A = J in (23)] using (22) and the 
independence assumption 


Fe )Tet) = EMT T a ax"K 7) (I = aX MK MT) 


= (1 — 2a + a’® Nee, (26) 
Thus the error decays to zero as 
(1 — 2a + a®N)"F, (27) 
which is optimized if a = 1/N to give . 
(: - x). (28) 


Note how convergence is slowed as the number of taps N of the 
problem increases. By contrast, if A = J in (25), choosing A = 1 gives 
convergence in one step, independent of dimension. 

The convergence range of (24) for A = I is 0 < a < 2, while for (27) 
it is 0 <a < 2/N. In practice, N ranges from about 7 to 64 and thus a 
is, by the requirement of convergence of the mean-square error, kept 
quite small. 

In order to examine independence theory further, it will be conven- 
ient to discuss the (positive definite) error matrix 


R” = BFeMeT. (29) 


All the information we wish about E¥®, the average excess mean- 
square error, is contained in (29). Thus, from (23) 
EE) = Ee TA = ¥ (a)ij(Eee)j; 
ij 
= tr AR”. (30) 
Similarly, the average norm E ||e” ||? = tr R™. 

Our procedure for writing an equation for the time evolution of R” 
is simply to write the definition of R“*” using (29), substitute (22) for 
e"t) and do the average using the independence assumption. Various 
cross terms arise, and the computations naturally fall into three steps: 


Step 1: 
El — aX? XT JeMEMTMT — oXOKMT] 
aa R” = af AR” + RA] a VEX MX MTR MK MX OMT, (31) 
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Appendix A discusses the evaluation of the last term. For simplicity, 
we approximate the exact evaluation by «7A tr AR”. When A = IJ, we 
have tr[a’A tr AR” ] = a°N&, so in general this term plays the role 
of the a?N term in (26). 

Step 2: 


Eo{I — oX?X 7] eX (C#TX™ — anax). (32) 


This is considered further in Appendix A and, for reasons given there, 
is approximated by zero. 
Step 3: As discussed in Appendix A, 


Ea’ (e*?7X™ J An+k) XX” 7 (e* TX () £= Qn+K) ~ art. (33) 


Putting together these three steps, we have the following accurate 
approximation from independence theory: 


R™*) = R® — of AR + RA] + 07A tr AR” 4+ a’S*A. - (34) 


Note that the last term prevents R™ = 0 from being a solution. Thus, 
R™ is prevented from going to zero by the small forcing term. Thus, 
in particular, e” only approaches zero but then executes small fluc- 
tuations about zero. 

Since (34) is an approximation, we prove in Appendix B that the 
positive definite character of R” is preserved in (34). 

We now introduce a more useful form of (34) when the mean-square 
error is of primary interest. Since A is hermitian, let U be the unitary 
transformation which diagonalizes A, 


U*tAU = D, (35) 
where we call the elements of the diagonal matrix D, by d;. Further, let 
UtR™U = T™. (36) 


In general, T is not diagonal, but set T{? = t{”. Further, note 
N 
o = tr AR” = tr DT™ = J: dit!”. (37) 
i=1 


It follows from (34), (35), and (36) that 
TT?) = T — a[DT + TD] + oD tr DT + a? F*D. (38) 


Noting from (37) that the mean-square error depends only on the 
t\"), we are motivated to look at the diagonal terms of (38). Happily, 
they decouple from the off-diagonal terms and we have 


tern = ti”) _ Qad;t!” “an ad; ») d;t}” ae a’F* d,. (39) 
j=l 
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If we introduce vectors t™ and d in the obvious way, (39) itself can be 
rewritten in matrix notation as 


tr) = Mt” + a’S*d, (40) 
where the N x N matrix M has elements 
Mi; =(1- 2ad;)6i; + a’did;. (41) 


From (41) we note M is real and symmetric and thus has real eigen- 
values. 

The solutions to (40) will be stable if and only if the matrix M has 
all eigenvalues A; such that —1 = A; S 1. Let g be an eigenvector of M 
with eigenvalue A. Then 


Mg = Ag (42) 
reads 
&i ~ 2adigi + a°*(¥) dig;)di = dei 
j 
or 
2 dj 
&i = —a (> ce ae (43) 


g: denoting the components of g. 

In (41) we see that, whenever d; = 0, there is aA = 1 for all a. The 
eigenvector has g; = 1 and g; = 0, 7 ¥ i. These eigenvalues do not 
change with a and are not of interest here. Set d; = d; if d; ¥ 0. Then 
we are concerned with 


M; =(1l-— 2ad;)8i; + add; (44) 


in a space of appropriately reduced dimension N. For a small enough, 
the eigenvalues are approximately 1 — 2ad;< 1 (a > 0, of course). Now 
increase a until possibly one of the eigenvalues becomes +1. What is 
the critical value of a? Since all elements of (44) are strictly positive 
(except at most N values of a), the magnitude of the largest eigenvalue 
may be taken to be associated with a positive eigenvalue.’ Thus, in 
(43) [reinterpreted to match (44)], set A = 1, multiply by d;, and sum 
on l. We then obtain 


2 2 
Qerit = 0, = va: (45) 
Thus, independence theory predicts a stable algorithm if 
0<a<=——= Ea (46) 
Xd Nd 


d being the average eigenvalue of the channel correlation matrix A. 
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The excess error &{ after adaptation may be derived from (39) 
using (37). We get 


d: 
© = gge 2G 47 
a Yd; wy 

The above discussion should provide the reader with an idea of what 
we hope to justify and why. The independence assumption, if it leads 
to valid results, provides a very workable theory for gaining insights 
about, and doing calculations on, the convergence procedure. 


IV. AN EXACT DESCRIPTION 


In this section, we put forth an exact description of how, in principle, 
the average mean-square error may be obtained. We begin, however, 
with the average error vector Ee”, a simpler quantity, but one which 
requires essentially the same treatment. The exact dynamics of e” is 
given in (22), and the independence theory for Ee” is given by (24). 

For simplicity, we rename the terms in (22) 


(I — aX XT) = Pie” (48) 
and 
—a(e' 7X” — anyx)X” = £, (49) 
so (22) reads 
ED = Pie 4 fe, (50) 


which, by iteration starting with a fixed error vector e, has the 
solution 


n n-1 n 
cD = TT Pe 4 ( Il Pe +f, (51) 


i=0 s=0 \i=s+1 
Note in (51) the matrices P; do not commute so that a product IT P; 


means in the order P,, --- PoP. 

We proceed to examine (51) in more detail. We remark first that, by 
their very definition, P,, and f” depend on the data variables {an, Qn+i, 
+++, An+m-1} [see (11), (15), (48), (45)], and thus et”) depends on the 
entire sequence {a;}2s“"'. If we formally average (51) making use of 
the stationarity of the basic Bernoulli sequence {a;}, we have 


Ee) = (« I Pe” +) (z I Pe”), (52) 
1=0 s=1 i=] 

the expectation being taken over all binary variables which enter (52), 

namely, ao, Qi, --+ , Qn+m-1. The first term of (52) represents the decay 

of the initial error to zero (the transient); the second term is the forced 

response, causing a small but nonzero steady state error as n > ©, 
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We have not been able to work with (52) directly, and at this point 
our analysis takes a crucial turn. We average (51) again, only this time 
we do not average over all the binary variables which enter but only 
over the sequence do, a, «++, Qn. Call this conditioned average E,. 
Then 


n n-1 n 
E,e"*) = (x, II Pe” +> (, Il Pf”) + E,f”. (53) 
1=0 


s=0 i=s+] 


Now, however, (53) is not one vector equation but 2”~' of them, since 


it is valid for any sequence of values of {@nii1, +++, Qn+m-i}; these 
variables appear in (53) for arbitrary values. Thus the set of values 
just mentioned form a “super-index” which we may collectively call -J, 
J taking 2“ values. For example, we might choose to call (for M = 
3) the values {+1, +1} to be J = 1, {+1, —1} to be J = 2, {—1, +1} to 
be J = 3, and {-—1, —1} to be J = 4. For the moment, however, the 
precise mapping from the (M — 1) binary variables to the integer J is 
unimportant. 
We also want to consider the matrix 


P, =I -— aX™XT (54) 


not as a N X N matrix, but as one consisting of 2%"! x 2”~' blocks of 
N X N matrices so that it may act in (53) as a transition matrix 
between vector blocks. 

Thus in (54) P,, is determined by X™’, i.e., from (1), by 


Qn 


Qn+1 


X™=By : (55) 


Qn+M-1 


Hence the “super-index” J corresponding to the vector result of an 
operation by P, would be the last M — 1 components of a’, namely 
(Qn+1, An+2, ***, Antm-1). On the other hand, P, acts on a quantity 
determined by 


xe@ey =B ; (56) 


QAn+m-2 


that is, something with vector index J’ = (Gn, +++ , @n+m—2). Thus if we 
call 


I — aX”X™T = K(J, J’), (57) 
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K(J, J’) can only act between index pairs (J, J’), which are “shift- 
compatible.” Thus if 


J <> (S81, °° * 5 Sm-1) 
J’ (th, +++, tn-1), (58) 

where the s; and ¢; are binary variables, then 
K(J, J’) =0 unless 5; = tis, t=1,---,M—2. (59) 


On the other hand, if (J, J’) are shift-compatible, this is enough to 
determine the appropriate X". Thus with (58), (59), 


X”=B (60) 


and we use (57) to define the appropriate K(J, J’). Having, in the 
manner thus described, achieved the block structure (57), we define 
the N x 2"! dimensional square matrix 


K(I1, 1) K(1, 2) -+- K(1, 27!) 
; . : 
A(a) = ‘ : (61) 
K(2™")) 1) 


There are, in fact, in any row of (61) only two nonvanishing blocks. 
Summing over the row thus corresponds, because of the factor of 2 in 
front, to averaging over the first component a, of A”. 

We write any N vector which is further labeled by our block index 
J [v(J), say] as an N x 2" vector V 


v(1) 
v(2) 
V= : (62) 


v(2™"') 


To tie this all together, it is now easy to convince oneself that, if we 
let V,41 correspond to E,,€*” as in (62) and, similarly, let F correspond 
to E,,f”, then, by making use of the stationarity of the averages which 
appear, (53) represents the solution of the equation 


Vier = Al(a)V,+ F (63) 
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with initial condition 
Vo=f | =[e™). (64) 


In (64), the notation [v] has been introduced to represent an n-vector 
“stacked” 2”! times. 

The solution to (63) and (64) contains all the information we want. 
In fact, once V, is known we, by definition, know E,-,€ (J), where 
we have modified the notation slightly to make explicit the dependence 
on Jd < (nai, °** , Qn+M-1). To regain e”), we simply average: 

QM-1 


1 
é” = E[Ene” (J) = 59 D Ene” (J). 169) 
J=) 


The average in (65) can be put in another form if we introduce the 
matrix} 


IITI-.-- 
IIdTI-.-- 
1 é 
P, = QM-1 ? ’ (66) 
III... 
having each N X N block equal to the identity matrix. Then 
[fe] = PiV,. (67) 


We may already note that P; is an orthogonal projection operator (Pj 
= P,, Pf = P,) and (67) thus states that [e”] is a projection of V, into 
an appropriate subspace. Further, note that 

[Ef] = [0] = PF (68) 


and thus F belongs to the orthogonal subspace. 
The formal solution of (63) (including the final projection) is 
n-1 
PV, = PiA"(a)[e] + Pi ¥ A*(a)F (69) 
s=0 
having the limit 
P.V.. = P,[I -— A(a)]"'F. (70) 


{+ We hope a warning that the symbol P, is being used for different things in (66) and 
(48) will eliminate confusion. 
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Both (69) and (70) can be computed using the spectral decomposition 
for functions of a matrix A. If A has all its eigenvalues A; of index one, 
that is, if its eigenvectors U; span the space (all Jordan blocks one- 
dimensional) and if W; are corresponding eigenvectors of A*, chosen 
so that 


W?U; = 8y, (71) 
then for (almost) any function h(-), 


h(A) = ¥ AA)UW?. (72) 


Roughly, h(-) is restricted so that A(A,) is defined. A similar but more 
complicated theorem holds if the U; do not span the space. If a # 0, it 
may be reasonable to assume that the U; do indeed span the space, 
but for a = 0 they do not. 

We may already note that asymptotic stability of the full-fledged 
algorithm is guaranteed if all eigenvalues of A(qa) are less than unity in 
magnitude. In fact, only those eigenvalues which are associated with 
a U; such that PU; ¥ 0 need have magnitude less than unity. 

In general, because of the very large dimension (N2™~') encountered 
in practical use, the above theory would be more useful if workable 
approximations could be found. We present one such approach in 
Section VI which is based on a perturbation approach for small step- 
size a. Before doing that, we retreat a bit to demonstrate how the 
mean-square error may be brought into essentially the same form just 
developed for the average error vector. 

We again find it more convenient to discuss the error matrix R”? 
defined in (29). We substitute (50) directly into (29) and perform our 
trick of taking the average EF, (which involves averaging only over ao, 
1, °*+, Qn leaving Qn4i1, +** , Qn+m-_. fixed) to obtain 


1 
E,R®*” ==Y P,(E,iR™)P, + 3 Ie aes ca 


bol = 


+ 5 >; (P,,(E,-1e™ )£™ + f”(E,-1€)7P,,). (73) 


In (73), )ia, refers to summing over a, = +1. Note that in (73) the 
sequence of quantities E,-,«” may be regarded as known (or calcula- 
ble) since they are the N dimensional subvectors which make up the 
N x 2™~! dimensional solution V,, to (63) and (64). 

We will rewrite (73), but first we need some notation. If R is any N 
x N matrix, we may make an N? dimensional vector out of it by 
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writing the quantity 


Ri 
Rie 


Rin 
Ray 
Roe 


qRy= | "|. (74) 


Rn; 


Ryn 
We call €(R) the vector made out of R. 
In this trivial sense, we use €(-) as an operator. We use this to turn 


some of the terms in (73) into vectors. Introducing the ‘“J-index” for 
emphasis (it is, of course, implicit when we use E,,) we define 


w" (J) = ER’ (J)] (75a) 
g(J) = (Eff) (75b) 
(Vn, J) = (Eff? + E,PreO£”? + Efe 7P, J. (75c) 

Next we note that if A, R, and B are N X N matrices, then 
(ARB) = C&(R), (76) 


where C is an N? x N? matrix. In fact, C is the direct product A ® B” 
where A ® B (not A ® B”) is given by 


anB ae2B +--+ aywB 
a1aB ax2B +++ aenB 
A@B= ; (77) 


amB an2B --+ annB 
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In the notation (75) and (77), (73) may be rewritten as 


1 
wD J) = 5 YP, ®@ Prw™(J’) + g(Vn, J). (78) 
In (78) J’ is the compatible pair of indices that are allowed with J. As 
in (63), we form N? x 2™”~! dimensional vectors W,, G, and G(V,) 
from w"(J), g(J), and g(Vn, J), respectively. And finally, using the 
definition of K(<J, J’) in (57) to (60) we write 


K(1,1)® K(1,1) K(1, 2) @ K(1, 2) --- 


1 
B(a) = 5 ’ (79) 
K(2™"1,1)@ K(2™”"1,1) ... 
The collection of equations (78) reads 
Wr+i = B(a) W, + G(V,). (80) 
Equation (80) with (63), (64), and the initial condition 
€(EM EO) 
€(e°e 7) 
Wo = (81a) 
z(e%) 


provide an exact description of the error matrix. 
To simplify matters, we replace (80) by the approximate version 


Wr+i = Bla) W, + G, (81b) 


where G, as already defined, is formed from (75b) as G(V,,) was formed 
from (75c). When more is understood about the solutions of our 
equations, we see that the replacement of (80) by (81b) is not a serious 
matter.t 

Again, we are not interested so much in W,, as the projected version 


[é(R”)] = PiW,. (82) 


In (82) the bracket notation is the same as (64) except that ¢(R™) is a 
vector of dimension N? instead of N. Also, in (32) P; has the same 
meaning as in (66) except that the identity matrices are all in N’ 
dimensions instead of N. 


+ In most situations, G(V,,) is small compared to the initial error and the associated 
transient. The main effect of the forcing term is to give a nonzero error as n — ©. But 
V, — 0, and G(V,.) reduces to G. 
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In summary, we thus see that both eq. (63) for V,,, which represents 
E,—1€™, and (81b) for W,, which represents E,n-:.R™, have the form 


Vnoi= LAV n+ F, (83) 
with 
Yo = [4], (84) 
and the quantity of interest being 
PAY, (85) 


for the appropriate dimension and projection 7. 


V. THE CASE a=0 


The equalization problem is uninteresting when the step-size is 
taken to be zero, i.e., nothing happens. However, since we soon intend 
to do a perturbation analysis about a = 0 we must be familiar with our 
formalism when a = 0. This is not trivial, and we devote this section 
to it. 

To display matrices explicitly, we need a labeling procedure. We let 
the “super-index” J run from 0 to (27! — 1).¢ The J value which 
labels (ai, --- , @u-1)(a; = +1) is gotten as follows: Change +1 to 0, 
and —1 to 1, obtaining then binary representation of J. Thus, for M 
= 3, J = 0, 1, 2, 3 correspond respectively to (+, +), (+—), (—+), and 
(—, —). With this labeling we have 


1 1 
1 1 
01 0 O1 0 
01 01 
l 001 001 
A (0) =F 001 001 OIl=Tel. (86) 


000 --- 1 00 --- 1 
000 --- 1 00 --- 1 


Let S be vector space of dimension N or N? accordingly as ”, in 
(83) refers to V, or W,,. Then in (86) J refers to the identity in S. 

The matrix .7(a) has the same structure as (86), with each identity 
being replaced by the appropriate J — aXX7 or (I — aXX7) ®@ (I - 
aXX7*), 


+ This labeling is gOr descriptive convenience here. We hope the reader is forgiving if 
we later let J = 1, 2, --- ,2”~'. We will be explicit about the convention when it matters. 
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Table I—Zero eigenvalue 
structure of T° 


# Blocks of 
Index This Index 


1 


1 
2 
4 
8 


SSSR 


2'-2(1 > 3) 
‘—(M-2)=2 9 ates 
1 


QM-3 


aa 


The matrix I in (86) is basic to our study and we now concentrate 
on it; it has dimension 2”. Clearly, the all-ones vector is an eigenvec- 
tor of I having eigenvalue one. The reader may convince himself that 
I" is proportional to the matrix consisting of all ones, which has 
(2“-! — 1) eigenvectors perpendicular to the all-ones vector. These 
eigenvectors are associated with eigenvalue zero. Using the fact that 
the eigenvalues of a power of a matrix are the powers of the eigenval- 
ues, we conclude that I has one unity eigenvalue and (2”~! — 1) zero 
ones. The zero eigenvalues are not of index one however (index, recall, 
is the dimension of the Jordan block). Table I summarizes the structure 
of the zero eigenvalues of I. 

While it is not crucial for the sequel, we also give the eigenvectors 
and generalized eigenvectors of I. These are the columns, albeit 
permuted, of Hadamard matrices H,, constructed according to H2 = 1, 


HA, H,, 
Ae, = = Hh.,. (87) 
H,, —H, 


Rows and columns of H,, are labeled from 0 to n — 1. Our claim is that 
the columns of H(2” — 1) are the (unnormalized) generalized eigen- 
vectors of I’. Recall that a sequence of vectors x, / = 1, --- , k forms 
a chain of generalized eigenvectors corresponding to a k-dimensional 
Jordan block when 


DX) = Xi41 l=1,---,k-1 
TX, = XXz. 


Clearly, the last 2“? columns of H satisfy 'X; = 0 and these are the 
only ones. If c; is the Ath column,'2”~? + 1<& = 2”"' then the chain 
that ends with it is, in reverse order,f 


+ For (88) to hold, it is essential that the first column be labeled co. Also, of course, 
the c, of this section is different from c; in Section II where it signified equalizer taps. 
No confusion should arise. 
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(Ck, Ck/2, Cr/4, **°)- (88) 
These notions may be verified for 


Te ke ae 2 ae 
=]. l=). tel] t=] 
Label 1. Leb St 
Sh sal, - th. - sek) +1 
Tok faeh=l=f--—1 
ak. eho t= 
feel ele het, 2 fT 
-l1-1 1-i 1 l1-=-Il 


(89). 


en 


The chains are (4, 2, 1), (5), (6, 3), (7). If we rearrange the columns of 
Hg to give 


He = (Co, C5, C7, C6 C3, C4, C2, C1), (90) 
then 
1 | 
0 
0 
| 01 
010 
001 
000 


From the direct product structure in (86) we conclude that if ®; are 
a complete o.n. set for S, then the generalized eigenvectors of .7(0) are 
1 
ae 
cr being the columns of the Hadamard matrix just described. In 


particular, ./(0) has N (or N’) unity eigenvalues of index one, having 
eigenvectors 





Cr ® ®,, (92) 


®; 
Q; 
1 ; 
U; = Voit . ; (93) 
0; 


the remaining eigenvalues are zero. The projection operator onto the 
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space spanned by the eigenvectors having A = 1 is, using (93), 


YUU =A, (94) 


where , has already been introduced in (66), J being the identity of 

S. Since Y; = 1, the projection is orthogonal. We call the projection 

onto the “zero eigenvalue subspace” of ./(0) by Ao and Ap =I — Py. 
Thus, when we solve (83), we really desire, according to (85), not Y, 

but Y,/,, its projection onto the unity eigenvalue subspace of .#(0). 
A standard spectral representation of .(0) is 


AL(0) = Pi + Do, (95) 
where J}! = 0. This defines (for us) Do. It may be shown that A; Do 
= 0. 

We remark here that our basic equalization problem is unchanged 
if any infinite sample sequence of data values {a,} is replaced by their 


negatives. This follows from the quadratic nature (in the a,) of the 
algorithm (19). As a consequence, we have 


E,eé” (Io Si, °°, Sm-1) = Ee” (Jo Si, °°", —Sm-1) (96) 


and similarly for w”(J). We have not exploited this symmetry, but if 
we had, the dimension of ./(a) could be reduced by a factor of 2. (0) 
would then, in particular, have a different form, but would have many 
of the same properties discussed here. 

Finally, we take this opportunity to get some notational problems 
out of the way. We introduce a convenient way of labeling matrices C 
with block structure as in (86). Label rows by p, p = 1, 2, «++, 2!—) and 
likewise columns by »v. If we write 


p=(i-Intk 185ij752 
v=(j-l1)n+4+l 1<k,l1<N (or N’) 
n=WN (or N°”), (97) 


then the pair (i, 7) specifies which block we are concerned with, while 
the pair (k, 1) are the usual matrix indices for the N x N (or N’ x N’*) 
matrix in that block. Thus, for example, in (77), 


(A © B),y = aybui. (98) 
Likewise, in (92) the vector c ® ® has components 
(c @ @), = c; D,. (99) 


The orthonormal basis for S where the kth basis vector has a one in 
the kth position and zeros elsewhere is denoted by {e;}. 
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Vi. THE PERTURBATION THEORY 


We begin the next stage of analysis by writing our matrices in a new 
basis. Consider the orthogonal transformation matrix U which brings 
(0) to Jordan form, namely, the matrix U whose columns are of the 
form 





c; @ ex, (100) 


9M- 


where c; are columns of the Hadamard matrix of appropriate dimen- 
sion, and e, are the basis vectors of S. In (100), i and k range over all 
possible values. The columns of U are assumed to be arranged so that 
the result on ./(0) comes out “nice.” We will not bother to be too 
explicit, except to say that the first N(or N’) columns of U are ~ 





co@e, k=1,---,N(N”). (101) 
9M-l 
Thent 
UTd (a) U = f | Gy. (102) 
In (102), B is an N X N matrix, v is N x (2“7! — N) matrix, etc. If a 
= (0, (102) takes the form 
I 0 
E aI (103) 


being a Jordan block exemplified by (91), i.e., “nice.” Note that 9° 
= 0if/=> M-1. 

In general, when a + 0, all blocks in (103) have added terms which 
are linear in a, or linear and quadratic, depending on whether (61) or 
(79) applies. 

We shall be especially concerned with the matrix £, for it is here 
that the germ of independence theory appears. To calculate it, we 
want 








Bru = = Co ® e (a) Co ® ei} (104) 


9M-| 
Calling the (m, n) element of the (i, 7) block of (a) by %6%n, (104) 


becomes 


+ Henceforth, we denote transformed quantities by a tilde. 
+ The reader should note that the simple equations (27) and (28) suggest that the 
linear and quadratic a-terms are of equal importance for ranges of «a of interest. 
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1 i 
Bat = 5x1 2, (Co)i(€x)im Ginn (Co) (C1) x. (105) 


mn 


Now 64, = 0 whenever I’; in (86) is. Thus, for fixed mn there are only 
2” possible 6%, which are nonzero. Denote the sum over these as 
nonzero. Then (105) becomes, using (€:)m = Skm, (€o)i = 1 


1 Bs 
Bri = 3M YY Oh. (106) 
Equation (106) gives 8, as the average of the (k, 1) elements of all 2” 
blocks in .7(a) which are not a priori zero. This, however, is nothing 
but 


E(UI — aXX7) =I- aA, (107) 


precisely the matrix which enters in the independence theory! Like- 
wise, if .7(a) = B(a) 


EU — a XX‘) ®@ (I— aXX") (108) 


is the matrix by which we would solve independence theory had we 
rewritten (31) giving R™ its vector form rather than its matrix form. 

What do vectors look like with our new o.n. basis? If ” is a column 
vector of numbers in the original basis, then in the new basis the 
numbers are U7¥. Let VY be considered as blocks of N(N7’) vectors 
®', the kth component of each is ®,. Then the inner product of a 
particular row of U with /”, namely, 

> (ci X Cr) 


Be 


is a generic term of U” y which evaluates to 





1 QM-1 
Wa p (Ci); (109) 
QM=-i imi 


Thus the first N(N?) components (the first blocks) is simply ¥2”~! 
times the average of the blocks of /. In other words, 


® 
® ® 

"1 _, yg (110) 
: 0 

® 


The right member of (110) is, of course, written in a notation compat- 
ible with (102). Likewise, a vector with zero average transforms to a 
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vector which may be written 


0 
ian ; (111) 
@ 


Thus the initial condition for (63) or (80) is of type (110) unlike the 
driving term for (63), which is of type (111). The driving term for (80) 
has both types. 

Finally, we note that the projection operator onto the unity eigen- 
space of . (0) is 





I 0 
0 0 
while 
0 0 
Po — 3 (113) 
0 q 
It will also be convenient to write 
a (oa 
UY, =V,= : (110b) 
/9M-1 Yn 


Putting together the pieces just described in this section, the contrast 
between the mathematics of the exact theory and independence theory 
is as follows. The former problem is the following: solve for x, where 


Xn+1 B v\| xn ® 
= + ; (114) 
Yn+1 Y 6 Yn VY 
where xo is given, yo = 0. The latter problem is: Solve for x, where 


Xn+1 = Bxn + ®, (115) 


Xo is given. Note if v and y in (114) were zero, the solution to the two 
problems would be identical. Since v and y vanish when a = 0, we may 
hope a perturbation approach will be useful for small a. More specifi- 
cally, we treat 


B-TI y 
Y oa 
as a perturbation of (103), the matrix A(a) when a = 0. 
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We begin by considering the eigenvalue problem for (a). When a 
= 0, the eigenvalues of 8 are unity while these of 6 are zero, and these 
eigenvalues vary continuously as a is increased. Consider solving for 
the large eigenvalues. In general, we have to solve 

Bx+vy=Ax 
(116) 
yx + dy = Ay, 
where AJ is one of these eigenvalues, presumed close to one. Since the 
eigenvalues of 6 will be presumed smaller than A, (A — 5)~' exists and 
we conclude from the second equation of (116) that 


= (A — 6)""yx. 


Substituting this into the first equation yields 





[e+e toy|emas (117) 


Consistent with the perturbation spirit, we replace the A (on the left) 
by 1 and 6 by its value when a = 0, namely, Y [see (103) ]. 
Thus the large \’s are (approximately) solutions to 


Bae ral fees (118) 


and the corresponding eigenvector to .(a) is, approximately,f 


x 
(119) 


jae 


Using these approximations and applying the spectral decomposition 
discussed in (72) to evaluate .¥”(a), it is now straightforward to show 
that the desired solution to (114) is, at least if we neglect the small 
eigenvalues. 


ma | Pte p ts g 1| = 


n-1 


1 : if 


+ With the present representation, the perturbation theory has been painless. More 
formal and more thorough approaches to perturbation theory of matrices may be found 
in Refs. 2 and 3. 
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From (120) we conclude 


lim x, =[I-B-—» 


1 
BNE I- g = 


In fact, the steady-state error can also be computed exactly from (114) 
as 





y]"[® + »—— _¥]. (121) 


[I-B-» vy [e+ 








Tx5 xs Vi. (122) 
Within the spirit of our approximations, (122) is consistent with (121). 
The neglect of the small eigenvalues is justified by the fact that their 
contribution will damp out quickly, and also that they operate in a 
subspace approximately orthogonal to the one we are interested in. 
Thus in (119) the “second half” of the large eigenvector is small 
because of the y factor. The corresponding form for the “small” 
eigenvectors would have the first portion small. 
We take (120) and (121) as our approximate solution. The terms 


1 1 
d »v——v 123 
a ea 3 L£ Y an v I = L£ ( ) 
are higher order terms in the perturbation, and neglecting them we 
obtaint 


= Bm + ¥ po (124) 
s=0 


1 
o =—— O, 125 
| eae ea (125) 
exactly what independence theory would predict. 
To examine further the key el 


.+o—— (126) 


= “FZ m 
some more concrete expression for the vy type terms is needed. For 
example, consider an initial error matrix Ro. Then 


= E[I — aXiXT|Ro[I - «Xi XT]. (127) 
This must correspond to £ and so, as we already know, 
B = E{I -— aX:Xf] ® [I — aXiX7]. (128) 


In general, then (neglecting the forcing terms), independence theory 


+ Noting that (A + e)" = A" for n = O(1/e) but not for 2 — o, we expect the 
approximation to break down after a while. This may very well happen only after the 
taps have, for practical purposes, converged to the desired solution. 
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can be written (letting P, = I— aX,X2) 


Ras = EP,RpP3. (129) 
If we consider two iterations 
Re = E[{I — aXoXF][I — aX XT] Rf[I- aX XT [I -— aX, X7], (130) 


this corresponds, on squaring the matrix in (102), to B? + vy. Thus vy 
corresponds to 


E[P2PiRoPiP2 — P.ePiRoP:P.]; (131) 


where P., is simply a notation denoting that it (P..) is to be treated 
independently of Pi. The matrix Ro is not statistical. The proper way 
to write (131) ist 


vy = E[(P2 ® P2)(P; ® P:) — 6’). (132) 


In general, it can be shown 


M-2 


M 
yp — y= FS vf'y = ¥ [E(P.® P,)(P,® P,) — B’]. (138) 
I- Ba s=2 


s= 


Using (133) in (126) provides us with the next correction to the 
eigenvalues by way of (118). 
Furthermore, (133) suggests a simplified “dynamics” for R,, namely, 


M-1 


Ris = EP\R,Pi+ EY [PissPiRn-sPiPi+s 
s=1 
— PEPi hPa; (134) 


A general discussion of these correction terms seems out of the 
question. In fact, the expectations are not trivial to do. Instead, we 
resort again to the simple model of (15), where A = J, and#\ = tr R,, 
and set N = 3. For this case, we have been able to do the expectations 
and compute the eigenvalues of B and B + »[1/(I — Y)]y. The 
eigenvalues results are given in Table II. Certainly, in this case the 
perturbation philosophy seems well justified. 


Vil. CONCLUSIONS 


We conclude (as explained above) that a perturbation analysis 
suggests that the difference between independence theory and one 
which takes into account the correlations between the “gradient” 


f Using (A ® B)(C ® D) = AC ®@ BD, other forms are, of course, possible. 
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Table II—A comparison of 
the eigenvalues of 8 and 
its perturbation for a 
special situation 

1 


+ yp 
B as eg 
0.667 0.674 
0.555 0.543 
0.555 0.555 
0.555 0.555 
0.333 0.337 
0.333 0.333 
0.333 0.333 
0.333 0.333 
0.333 0.333 


directions is slight. Our early worry was that the shifting property 


xy x2 
Xo x3 

>| (135) 
Xn Xn+1 


in going from one gradient direction to the next could cause trouble 
with independence theory. Any notion that this particular dependence 
must result in mathematics completely foreign to that of independence 
theory has been shown to be false. Independence theory is an inherent 
part of the exact description. 

The situation in (135) does, however, have the rigorous property 
that the “new” component (x,4+1) is independent of the others. For real 
problems, this situation may well be violated in certain cases of severe 
intersymbol interference. Examining the N = 1 case leads us to propose 
the following criterion to measure this dependence. Namely, if, in the 
synchronous case, the received pulse h(t) [see (1)] is normalized so 
that 0”. hi = 1, then we might expect 


_ © 2 
> ( > hah) <1 
s=1 \l=—-a0 

to be a good measure of independence for the new component. 

Our effort has been a long and tedious one, and our attempts to pull 
insights from complicated equations have sometimes been nonrigorous 
and no doubt occasionally colored by the previous experimental results 
and simulation results of others.' Thus, while the ultimate justification 
of independence theory must remain empirical, we hope that our 
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efforts at least make mathematically plausible the successes of inde- 
pendence theory. 

Finally, it is a pleasure to say that the present work has benefited 
from discussions with J. Salz, L. A. Shepp, N. J. A. Sloane, and H. S. 
Witsenhausen. 


APPENDIX A 
Evaluation of Some Averages 


For the purposes of this appendix, we drop the superscript in (11), 
labeling things as if n = 1. For application to (31) we consider the 
average 


EXX™TRXX* (136) 


for an arbitrary N x N matrix R. Here (11) holds, and we are averaging 
over the binary variables in a. Expanding (136) using (11) we have to 
do the key average 


Eaa’Qaa’ = EC, (137) 
where Q = B’ RB. Thus from (137), 
(EC) = EY (aa’)inQru(aa’),; 
il 


=K > Aj;Apn01A; Qari. (138) 
kl 


Using the fact that for independent binary variables 


Eaja,aia; = §in81; + budnj + S181 — 26:8 08i; (139) 
we obtain, upon using (139) in (138) 
(EC)i; = Qi + Qi + (tr Q)6i; — 2Qidi;. (140) 
In matrix notation, (140) becomes 
EC =Q+ Q" + (tr Q)I — 2 diag Q, (141) 


with the definition 
(diag Q)i; = (Qii)di;. (142) 


Note that if the a; were unit-variance Gaussian, the last term in (141) 
(diag Q) would not arise. It will be dropped because it is small in usual 
cases. Finally, multiplying (141) on the left by B and on the right by 
B’ we recover (136), obtaining (since Q is symmetric now) 


EXX™RXX’ = 2 ARA + (tr RA)A — 2B(diag B"RB)B*. (143) 
Now we recall that all terms in (143) are multiplied by a”. We would 
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neglect them all unless one can be large. In fact, (tr RA)A can be N 
times larger and hence this is the only term we need keep. 
We move on to consider (32), rewritten as 


—Ea(I — axx™)e[e’Txx7 — Ans sX7]. (144) 


The term linear in a in (144) vanishes as a correspondence of (6), (7), 
and (9). One of the a” terms is 


ao’ EXX Tec 7 XX. (145) 


Evaluating (145) using (143), we check to see if the dominant term can 
be large. It is given by 


Atrec7A = A(e?Ac*). (146) 


If we introduce the (M — 1) vector u, having all zeros except a one in 
the (1 + J) place, then 


Qnvg = U-a=u’a, (147) 


and the other a? term is proportional to 


wo’ EXXeu’aX? = a? EB[aa™(Bteu’)aa’ |B". (148) 
Evaluating (148) using (138) and (141), we get 
A(e"Bu). (149) 


However, using (7) we readily verify Bu = v, and a final use of (9) 
shows that the two dominant a” terms (146) and (149) cancel. The 
other terms are truly a? terms (as opposed to a?N) and are neglected, 
leading us to replace (32) by zero. 

We have introduced enough tricks now so that the reader may easily 
reproduce (33). 


APPENDIX B 
Definiteness of Solution to (34) 


We give here an explicit demonstration that the solution to (34) 
retains its positive definite character. By induction on 7, it is sufficient 
to show that R*” is positive definite (= 0) if R™ is. 

We make repeated use that R = 0 if R is hermitian and $’R¢ = 0 for 
any vector @. 

We recall A = 0 (and therefore hermitian) and hence R*” is 
hermitian. 

Rewrite the right member of (34) as 


(I — aA)R(I — aA) + a [A tr AR — ARA]+ 0°&*A. (150) 


Each term in (150) is positive definite; the only nonobvious one is the 
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second. However, it may be rewritten as 


VA [tr VA R VA — VAR VA] VA, (151) 


since tr AB = tr BA. The matrix VA R VA is, of course, positive 
definite. Now observe that if B = 0 then tr B — B= 0. This concludes 
the proof. 
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Transmission Properties of Various Styles of 
Printed Wiring Boards 


By A. J. RAINAL 
(Manuscript received October 18, 1978) 


This paper presents some experimental results concerning the pulse 
transmission properties of fine line printed conductors (e.g., width = 
8 mils, spaces = 9 mils) on various styles of circuit packs (cps). The 
pulse transmission properties include the characteristic impedance, 
the propagation delay, the rise time, the bandwidth, and the intra- 
layer and interlayer pulse crosstalk. A simplified theoretical model 
is presented which leads directly to some basic crosstalk equations. 
Theoretical results are developed to extend the application of the 
experimental crosstalk results to arbitrary pulse signals, periodic 
signals, and random signals. Also, theoretical scaling laws are 
developed to extend the crosstalk results to conductor spaces in the 
range of 7 to 40 mils. The crosstalk results are very important, since 
they tend to limit the packaging density of printed conductors on the 
cp styles by limiting the coupled length and spacing of parallel 
conductors. The results can be incorporated into computer-aided 
designs which can analyze routed cps to detect potential crosstalk 
problems before the cp routing is finalized for manufacture. Other 
applications include cp selection, crosstalk estimation, electrical 
characterization of cps and backplanes, estimation of conductor 
capacitance and inductance, and effects of various dielectrics. The 
results are applicable to general styles of printed wiring boards. In 
particular, they apply directly to all styles of cps in the BELLPAC™ 
apparatus housing—a modular packaging system for packaging elec- 
tronic equipment in the Bell System. 


1. INTRODUCTION 


In the physical design of large electronic systems, the interconnec- 
tion of the integrated circuits and other components at the circuit pack 
or printed wiring-board level constitutes a basic and relatively expen- 
sive level of interconnection. In addition to supplying power and 
ground, the circuit pack (cP) provides the conductor paths for the 
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Table |—Description of the circuit pack styles 


Circuit 
Pack 
Style 


Wire wrap 
Extender board 


Double-sided 
(epoxy) 

Double-sided 
(metal) 

Bonded board 
(LAMPAC){ 

4L MLB (ExT P/G) 

6L MLB (ExT P/G) 

6L MLB (INT P/G) 

6L MLB (INT P/G, 
surface routing) 

8L MLB (INT P/G) 


Description 


Wire wrap board for breadboarding 

6 layer MLB, 2 pad layers, 2 signal layers, power (P) and ground 
(G) on inside, dedicated ground conductor between every 
pair of signal conductors 

Double-sided, epoxy PWB 


Double-sided, metal core, PWB 

Flex bonded to epoxy coated steel 

4 layer MLB, 2 signal layers, P and G on outside 

6 layer MLB, 4 signal layers, P and G on outside 

6 layer MLB, 2 pad layers, 2 signal layers, P and G on inside 
6 layer MLB, 4 signal layers, P and G on inside 


8 layer MLB, 2 pad layers, 4 signal layers, P and G on inside 


+ This particular bonded board is also known as LAMPAC. 


transmission of pulses and other types of signals between the inte- 
grated circuits, other components, and the cP connector. 

The basic pulse transmission properties, such as characteristic 
impedance, propagation delay, rise time, bandwidth, and crosstalk 
depend a great deal on the cp configuration or style. Since the costs 
associated with the various cP styles differ significantly, it is very 
important to develop cp styles which are suitable electrically and 
which are relatively inexpensive. 

For the past few years, a Bell System packaging effort’ (BELLPAC* 
packaging system) has been under way to develop a modular packaging 
system for packaging electronic equipment. This effort makes use of a 
suitable connector (963) and a number of cp styles that have common 
features suitable for computer-aided design. 

The purpose of this paper is to present some basic transmission 
properties of various styles of cps which include those in the 
BELLPAC hardware family. The transmission properties are very 
important, since they help to determine which cp style is most appro- 
priate for a given application. 

A listing of the cp styles along with a short description of each is 
presented in Table I. Copper conductors are used on all the cP styles. 
The dielectric material for most CPs is a composite of epoxy and glass 
fibers. The composite structure has a relative dielectric constant 
(effective) of about 4.2. Except for the extender board, all the cp styles 
have the common features shown in Fig. 1. 


* Trademark of Western Electric. 
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Fig. 1—Some common features applying to all circuit pack styles except the extender 
board. The plated-through holes are on either 100-mil or 200-mil centers. The spaces 
between the rows of plated-through holes are denoted as 100-mil or 200-mil channels. 


The extender board is a very special design. Its primary function is 
to extend any cp beyond the apparatus housing so that both sides of 
the cp are accessible for debugging or test purposes. Thus, the extender 
board is basically an “extension cord” for a CP. 

To determine the basic transmission properties of the various CPs, 
appropriate test boards were designed for each style of cp listed in 
Table I. Except for the double-sided (metal) board, all test boards were 
fabricated at the Western Electric printed-circuit manufacturing plant 
at Richmond, Virginia. The double-sided metal board was manufac- 
tured at the Western Electric plant in Kearny, New Jersey. The test 
routing consisted of either 8 + 2 mil conductors with nominal 9-mil 
spaces or 12 + 3 mil conductors with nominal 13-mil spaces. The 8-mil 
conductors were on 17-mil centers, and the 12-mil conductors were on 
25-mil centers. In general, the length of the conductor paths was about 
1 foot. 

An experimental approach was necessary for this study because 
detailed theoretical models which include all cp styles of interest 
become very complicated, and they are not now very useful for 
determining many basic pulse transmission properties. The experimen- 
tal methods used to determine the transmission properties of the test 

‘boards are described in the next section. 


PRINTED WIRING BOARDS 997 


il. DESCRIPTION OF THE EXPERIMENTAL METHODS 
2.1 Pulse transmission properties 


Each cp style containing the test routing was probed with a Hewlett 
Packard time domain reflectometer (TDR) system consisting of a 1815A 
sampling plug-in, an 1817A sampling head, and an 1106B tunnel diode 
pulse generator. The TDR system was used to apply a fast rising step 
signal into each cp and display the reflected waveform on a sampling 
oscilloscope. In general, the conductor path on the CP was open- 
circuited and was free of any parallel branches. 

For purposes of detailed analysis, a photograph was taken of each 
TDR display of interest. The general form of the TDR display is pre- 
sented in Fig. 2. By analyzing the TDR display of the reflected wave- 
form, one can determine the basic pulse transmission properties of the 
various CP styles. The particular cp properties of interest are the 
characteristic impedance, Z), the propagation delay, Tu, the 80-percent 
rise time, 7,, and the bandwidth, B. All these cP properties can be 
determined by analyzing each TDR display as indicated in Fig. 2. The 
80-percent rise time, T,, on the TDR display is a result of the input step 
signal traversing the cp twice, as is characteristic of a reflection 
method. The one-way rise time is faster by a factor of approximately 
1/2. By applying this factor to the usual relationship between band- 
width and rise time, we have 

0.35 . 1 


= T,/ V2 27, 


Reference 2 presents some additional discussion concerning the TDR 
method along with some detailed results concerning the theoretical 
TDR display for an ideal cP. 
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Fig. 2—The general form of the TDR display. 
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2.2 Pulse crosstalk properties 


The pulse crosstalk properties of the cps were determined experi- 
mentally by using the method described in Refs. 3, 4, and 5. Briefly, 
the method consists of applying a fast pulse (rise time ~ 2 ns) to a 
driven conductor and monitoring the resultant waveform at the near- 
end or far-end of some idle conductor of interest. In all cases, the 
crosstalk results apply when all conductors are properly terminated 
with matched loads. The corresponding results for other loads can 
yield higher values of crosstalk which can be estimated from the results 
for matched loads by determining the reflections and using superpo- 
sition. Thus, the crosstalk results for the matched loads are basic 
properties of the cp styles. The crosstalk results are very important 
since they limit the packaging density of printed conductors on the 
cps by limiting the coupled length and spacing of parallel conductors. 

We now summarize all the experimental results presented in the 
appendix to this paper. 


lll. SUMMARY OF THE EXPERIMENTAL RESULTS 


Table II presents a summary of the pulse transmission properties of 
all the cps considered in this paper. More detailed properties for each 
of the cp styles are presented in the appendix, as stated in the last 
column of Table II. 

The propagation delay per foot, the rise time, and the bandwidth 
include the effects of the 963 connector plus fanout (see Fig. 1). 
However, an earlier study® has shown that the 963 connector plus 
fanout limits all the cp styles to applications having one-way signal 
rise times (= 1/V2 of the TDR rise time values) no faster than about 
2.0 ns (bandwidths <= 175 MHz). This 2.0-ns limit was determined by 
considering the crosstalk levels and impedance mismatch associated 
with the 963 connector plus fanout. This lower limit on signal rise time 
is sufficient to include most applications in the Bell System. 

The pulse crosstalk results were measured as a percentage of the 
signal step in the driven conductor. The crosstalk results apply when 
the printed conductors are terminated with matched loads. 

The interlayer crosstalk can be decreased to negligible values by 
simply using orthogonal routing on adjacent layers. This technique is 
now widely used during the routing of the conductors. Therefore, 
intralayer crosstalk is usually more of a concern than is interlayer 
crosstalk. 

The attenuation of the conductors has not been thoroughly inves- 
tigated, but some preliminary results have shown that signal attenua- 
tion is about 0.4 dB/ft at 250 MHz. 
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Fig. 3—A simplified model of crosstalk for a pair of lossless, uniformly, and loosely 
coupled conductors terminated with matched loads. Z;, v, and /, denote characteristic 
impedance, propagation velocity, and coupled length, respectively. (a) Capacitive 
crosstalk, V,, resulting from the mutual capacitance per unit length, C,,. (b) Inductive 
crosstalk, V;, resulting from the mutual inductance per unit length, Ln. 


Table II shows that the cp styles in the BELLPAC family of cps* 
provide a wide variety of pulse transmission properties that can satisfy 
the cp needs of most presentday Bell System projects. Many current 
projects (e.g., AMARC, PDT2A, PLAID, Triport, Ess Ring and Tone, DIF) 
make use of the double-sided (epoxy) style. In fact, this is the most 
common cP style. The double-sided metal cP is used in customer 
equipment and is now under consideration for power supply applica- 
tions. The 4L MLB and 6L MLB (ExT P/G) were used in some switching 
applications such as the PROCON project. The higher capability MLBs, 
the 6L MLBs (INT P/G with and without surface routing), were used in 
the 1A ESS processor, and are expected to find use in projects such as 
the 3B Ess processor and DIF. 


lV. THEORETICAL CROSSTALK RESULTS 
4.1 Derivation of basic crosstalk equations 


Consider the simplified model of crosstalk presented in Fig. 3. The 
capacitive crosstalk voltage denoted by V. in Fig. 3a is a suitable 
approximation when the conductors are loosely coupled. A more 


* At the present time, the bonded board (LAMPAC) and the 8L MLB (INT P/G) are not 
members of the BELLPAC family of cpr’s. 
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Double-Sided 
(metal)§ 
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Table II—Summary of pulse transmission properties of various circuit pack styles 


Characteristic 
Impedance 


(ohms) 


125 + 50 
160 + 35 
70+ 5 


150 + 20 
150 + 20 
98 + 8 
83 + 6 
95 + 10 
85 + 10 
95 + 35 
85 + 30 
75 + 30 
70 + 35 
68 + 3 
6143 
85 + 25 


75 +15 


85 + 25 
75 +15 


Propagation 
delay* 
(ns per ft) 


a! 


ee ee ei 
co ou SIO © CO 0 CO COON OT oT Gn Or Ot ow 


(surface), 1.8 


1.5 (surface), 1.8 


1. 
1.9 


* Includes the effect of 963 connector plus fanout; see Fig. 1. 
+ Maximum pulse crosstalk occurs when the round trip propagation delay over the coupled length is at least as great as the signal rise time. See 


Section 4.2. 


Rise 
time* 
(TDR) 


(ns) 
2.0 


etre 
oo 


rt : PEP NNNNNNNNNN 
2 G DHODOMMNUMAAMAMARDR w 


—_ 


eo 


Bandwidth* 
(MHz) 


250 
278 
385 


190 
190 
190 
190 
190 


278 
278 


Maximumft 
Interlayer 
Pulse 
Crosstalk 
(Near-End) 
(percent) 


0.3 


20 
24 


Maximumt 
Intralayer 
Pulse 
Crosstalk 
(Near-End) 
(percent) 


40 


30 
1.6 


18 
12 


More Details 
Shown 
in the 
Following 
Figures 
in the 
Appendix 


oO wma NN Oo oO 


11 


12 


13 


+ The first entry applies to Milene (registered trademark of W. L. Gore & Assoc., Inc.) insulation, the second to Teflon (registered trademark of 
E. I. DuPont) insulation. 
§ The first entry applies to conductor width = 8 mils, conductor space = 9 mils, the second to 12, 13 mils. 


accurate expression for V. is obtained by applying circuit theory to the 
elemental circuit in Fig. 3a. The result is that V. satisfies 
dV. x Ve _ 1dV(t— (x/v)) 
dt 2Cnr 2 at 








(2) 


or 


‘ t-t a 
1 | one OVE — (x/v)) dé. (3) 
( 


Ve== 
x/v) ag 


When the conductors are loosely coupled, C,, is relatively small, and 
eq. (3) yields the approximation given in Fig. 3a, since 


(t-8) 


y= 2m | ‘  @ en aV(E— (x/v)) 1, ZiCm AV(t — (x/v)) 
a i oe ee ay a 
(x/v) 








2 Z1C im 0& ds at (4) 


The loose coupling approximation not only allows the simplification 
of V, but also allows one to neglect the interaction of the idle conductor 
on the driven conductor. When this interaction is considered along 
with conductor losses, the analysis becomes extremely difficult (see, 
for example, Refs. 7 and 8). 

Using this simplified model, the total near-end (backward) crosstalk 
waveform, V,.-(t), and the total far-end (forward) crosstalk waveform, 
V;-(t), can be expressed as two independent differential equations: 


il Ln aV(t — (2x/v)) 
dV ne(t) = 3 (2.0. + 2) a (5) 
at Lm\ , @V(t — (le/v)) 
dV;e(t) = 5 (2.0, Z) dx. (6) 
By integrating the variable x over the coupled length /., we have that 
I 
m F ) — 
2 Zi ; ot 
= K,.{ V(t) — V(t — 2T»)], (8) 
where 
Tp = = propagation delay over the coupled length 
U Ln 
Kne = — | Z21Cn + — |. 
4 ; Zi 
Similarly, 
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1 La\ (" dV(t— Tp) 
Vee(t) —= 9 (2c. = Z) [ ee ee (9) 


dV(t — Tp) 


dt a 


= Keele 


where 


1 


Equations (8) and (10) agree with the earlier results presented in Refs. 
3, 4, 9, 10, 11, and 12. These references also contain some useful 
discussions of crosstalk associated with printed wiring interconnec- 
tions. 

Notice that, if 2iC, = (Lm/Z), Kr = 0 and V;-(t) = 0. This result 
forms the basis of the design of directional couplers and occurs quite 
naturally whenever the conductors are surrounded by a homogeneous 
medium. See Ref. 13 for a discussion of this interesting point. However, 
for all the cps considered in this paper, it turns out that V;-(t) ¥ 0. 

The simplified model presented in Fig. 3 can be generalized to 
include the case when the driven conductor has characteristic imped- 
ance Z, and propagation velocity v, while the idle conductor has 
characteristic impedance Z, and propagation velocity v2. For this case, 


(5) and (6) become: 
ave aie *) 
Lin VU) v2 


ih Lin 
Kre _ 9 (2.0, [re Z) . 


AV, e(t) = 5 (220 + 2) ax ay aa (11) 
ove Seen) See Ti 
1 1 oS ut 
dV ;-(t) = 3 (2c. - Z) a aa (12) 


By integrating the variable x over the coupled length /., we have 


oy U1U2 
vat =} (zc5-+42)(_2) 


re zs te l, ( a ) || (13) 
U,U2 


1 U U2 
o( = Z Cn —- 


bee ah 
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Equations (13) and (14) agree with the results given in Ref. 11, and 
they reduce to (8) and (10) when Z2 = Z; and v; = v2 = v. 

The corresponding results for the case when the driven conductor 
has Ze, Ue and the idle conductor has Z, v; can be obtained from eqs. 
(13) and (14) merely by interchanging Z, and Z2 and also v; and ve. In 
this manner, one can determine the following general result: 

Vnelt, 22, U2) zs Vielt, 2, U2) = La (15) 
Vnelt, Li, U;) Vyelt, Li, U)) Zi 
where 
Vie(t, Z;, vi) = near-end crosstalk waveform when the idle con- 
ductor has Z;, vi, 
and 
V(t, Z:, vi) = far-end crosstalk waveform when the idle conduc- 
tor has Z;, vj. 
Notice that eq. (15) is independent of time and the propagation 
velocities. Also, it can be shown that (15) satisfies reciprocity. 

In cP design, one usually attempts to make the characteristic imped- 
ance constant throughout the cp or Z; = Z2. However, if Z; ¥ Ze (as it 
can be when dealing with interlayer crosstalk), then eq. (15) shows 
that both near-end and far-end crosstalk are reduced when the con- 
ductor having the lower characteristic impedance is taken as the idle 
conductor. 

Although eqs. (13) and (14) are more general, experimental work has 
shown that eqs. (8) and (10) or, more generally, eqs. (13) and (14) with 
U; = U2 = U are sufficient for characterizing the crosstalk on all the cp 
styles of interest in this paper. Also, in order to help simplify the tables 
in the appendix, we shall only report on the average interlayer crosstalk 
associated with a given conductor pair. This appears to be sufficient, 
since interlayer crosstalk is usually of less concern than intralayer 
crosstalk. 

The results based on the simplified model given in Fig. 3 turn out to 
be good approximations for printed wiring boards when the value of 
Kneis less than about 25 percent. However, even when K,,. is somewhat 
greater than 25 percent, the results based on the simplified model can 
still be applied, although they become less accurate in this region. 

On all styles of cps, we have found experimentally that max | V;-(¢) | 
< max V,,(¢) for all signals and coupled lengths of interest* in this 
paper. Thus, by controlling V,,-(t), one also controls V;-(t). Accordingly, 
we have directed our experimental work toward estimating the mag- 
nitude of the near-end crosstalk and only provide experimental bounds 
on the intralayer far-end crosstalk for all the cP styles. 


* Because of connector limitations, the signal rise times of interest are 2 ns or more 
(i.e., a signal bandwidth of 175 MHz or less). The coupled lengths of interest are all less 
than 18 inches. 
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4.2 Crosstalk resulting from a pulse signal 


Let V(t) represent a ramp type of pulse signal given by: 


Vot 
as O=t=T,; 
Vit) = { Tr 
(16) 
Vo, t > Tr. 


When V, > 0, V(é) represents a rising step signal having a 100 percent 
rise time of Tr. This type of signal is convenient for characterizing the 
crosstalk resulting from the leading edge of a pulse signal. 

In practice, the total near-end crosstalk, NEXT, is usually defined as 
the fraction of the pulse drive signal that appears at the near-end of 
the idle conductor. Thus, for the pulse signal, eq. (8) yields: 


Kas 2T pd > Tr 
max Vne(t) _ 


NEXT = ——————_ 17) 
Vo Ku 22), 2Tp = Tr. ( 
R 


When 27Tp > Tr, the near-end waveform, V,,-(t), is a trapezoidal pulse 
and when 2Tp = Tr, this waveform is a triangular pulse. 

Equation (17) shows that the value of K,,. represents the maximum 
value of near-end pulse crosstalk. In the appendix, the experimental 
values of K,- and 1/v = Tp/I. for all cps of interest in this paper are 
tabulated in Tables A and B of Figs. 4 through 13. By using these 
tabulated values and eq. (17), one can readily estimate the NExT for an 
arbitrary pulse-like signal on any cP style. 

As discussed in Section 4.1, only the average values of K,. for 
interlayer, near-end crosstalk are tabulated. To estimate the two 
individual values of K,- for interlayer, near-end crosstalk, it can be 
shown that each K,- value must be multiplied by VZ/Z, and 
VZ2/Z1. The values of Z,, Z2, the characteristic impedances of the 
conductors, are also tabulated in the appendix. 

The corresponding result for the total far-end crosstalk, FEXT, can 
be obtained from eq. (10): 


max | Vielt)| _ | Kiel l 
V, Tr ; 


In this case, the far-end crosstalk waveform, V;-(f), is a rectangular 
pulse. 

For all the cp styles, we have determined experimentally that | Ky | 
<= 0.09 ns/ft for intralayer crosstalk. We shall see that this result can 
be used to bound intralayer FEXT on all the cps. 


FEXT = (18) 


4.3 Crosstalk resulting from a periodic signal - 


If V(t) represents a periodic signal of period T = 1/f,, then 
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V(t) = y ane’, (19) 


where 
1 7? 
an = z| V(t) eter dt 
-T/2 
and 
Wn = n2rfo. 


For this periodic signal, eq. (8) yields 


oO 


Vnelt) = Kne Y (Qian sin wnT pen ™? , (20) 


n=-oa 


If we now define near-end crosstalk, NEXT, as 


ac power of Vne(t) Me 
NEXT = | Se poe ee ‘ (21) 
then 
Pn 1/2 
Y | an sin wnTp|? 
NEXT = 2K,. | ———~————— = 2Kyre. (22) 
Zier 


Equation (22) shows that NEXT < 2K, for all periodic signals. 
The corresponding result for FEXT, assuming no jump discontinuities 
in V(¢), is 


foo} 


Vie(t) = Kyle > ianwneenT) (23) 
and 
ne ac power of V;-(¢) “ie (24) 
~ | ae power of V(¢) 
a 1/2 
Y | encon 
=| Ky| i | —— : (25) 
| on |” 
n=1 


By using the tabulated values of K,- and 1/v = Tp/l, given in the 
appendix together with eq. (22), one can estimate the NEXT resulting 
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from a general periodic signal on any of the cps. Also, by using the 
bound on | Ky. | given in Section 4.2 along with eq. (25), one can bound 
the intralayer FEXT on any of the cps. 

As an example of a simple periodic signal, let V(t) represent a 
sinusoid of frequency f,. Then eqs. (22) and (25) yield 


NEXT = 2Kne| sin 27f,Tp| (26) 
FEXT = | Kye | L2rf. (27) 


When f, = (47 pn)’, NEXT attains its maximum value of 2Kne, which is 
twice the maximum NEXT resulting from the pulse signal considered in 
Section 4.2. 

_ It can be shown that eqs. (26) and (27) are special cases of the more 
general results presented in the classical works on sinusoidal crosstalk 
presented in Refs. 14 and 15. These references also include the effects 
of conductor losses. In our application, the coupled length, ,, is 
relatively short (J. = 18”), so that conductor losses are negligible over 
a frequency range of about 250 MHz. 

For small values of f,.Tp(= fol-/v), eq. (26) yields 
fole 


NEXT = 47KyefoTp = 47Kne ; 





(28) 


In this case, eq. (28) shows that NEXT is proportional to both frequency, 
f., and coupled length 7, much as is FEXT. 
4.4 Crosstalk resulting from a random signal 


Let V(t) represent a differentiable, stationary random signal having 
zero mean and one-sided power spectral density W(f). The correlation 
function, p(r), of the random signal is defined by 


p(t) = EL V(t) V(t + 7)] = | W(f) cos 2nfr df, —(29) 


where EF = expectation operator. 
The correlation function, p,-(7), of the crosstalk waveform at the 
near-end of the idle conductor can be determined from eq. (8). Thus, 


Pne(T) = EL Vne(t) Vne(t + 7) ] 
= K7.[2o(t) — p(t — 2Tp) — p(t + 2Tp)]. (30) 
The power spectral density, W,.-(f), of Vne(t) is given by 


co 


W,(f) = 4 | Pne(T) cos 2afr dr (31) 


0 


= 4K?,.W(f) sin® 27f To. (32) 
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For this random signal case, it is reasonable to define the NEXT by 





NEXT = ume (33) 
oO 
where 
One = rms value of V,,-(t) 
o = rms value of V(¢). 
Thus, 
One Pne(0) 
NEXT = — = 34 
o \ p(0) oe 
1/2 
= Kul = | < 2Kynre. (35) 
oO 


Equation (35) shows that NEXT is bounded by 2K,, for all stationary 
random signals. 
The corresponding results at the far end are 





pre(t) = EL Viet) Vie(t + 7)] = —| Kye |?lép” (7) (36) 
Wrelf) = [| Kye| le(2af)]? W(f) (37) 
and eo 
FEXT = < = | Kye | le Ve (38) 
= | Kre| ka, (39) 
where 


Ofe = rms value of V;-(E) 
B = average number of zero crossings per 
second of V(t). 
Thus, for all differentiable, stationary random signals FEXT is propor- 
tional to the average number of zero crossings per second of V(t). 

By using the tabulated values of K,. and 1/v(= Tp/J.) given in the 
appendix together with eq. (35), one can estimate the NEXT resulting 
from a general random signal on any of the cps. Also, by using the 
bound on | Ky| given in Section 4.2 along with eq. (38), one can bound 
the intralayer FEXT on any of the cps. 

As an example of a random signal, let 
a B B 
B ’ fo 2 = f = fo at 


Wif) = 
0, otherwise, 


(40) 
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where 

B = bandwidth of the signal V(t) 

fo = center frequency of the signal V(t). 
For this case, eqs. (29) and (35) yield 


ae * 9 BT 1/2 
NEXT = = V2K,.|1— (= |cos 4afTp | . (41) 
o 27BTp 


Notice that, as B — 0, eq. (41) approaches eq. (26), the corresponding 
result for the sine wave case. 
The result for FEXT is 


FEXT = 2£ = | Kye| ler V4fo + B?/3. (42) 
0 


As B — 0, this result approaches eq. (27), the corresponding result for 
the sine wave case. 

The theoretical developments in Sections 4.2, 4.3, and 4.4 can be 
generalized to include eqs. (13) and (14) in place of eqs. (8) and (10). 
When v; = v2 = », all one needs to do is replace Z2iC,, by Z2Cm in Kne 
and Ky. The more general case, v; ¥ v2, will not be treated in this 
paper, since experimental results show that the propagation velocity 
is approximately constant on a given cP. 


V. SOME APPLICATIONS 
5.1 Selection of a CP style 


Since the costs associated with the various cP styles differ signifi- 
cantly,’ it is very important to select a cP style which is both suitable 
electrically and relatively inexpensive. The pulse transmission prop- 
erties summarized in Table II and tabulated in more detail in the 
appendix can be used to help select such a cost-effective cp for a given 
application. 

It is also very important that the physical designers and systems 
designers using BELLPAC hardware be aware of these basic pulse 
transmission properties. The cp transmission properties must be com- 
patible with the transmission properties of the backplane, frame wiring, 
and the cP components. 


5.2 Estimation of crosstalk on a given CP style 


By using the theoretical results presented in Section IV together 
with the appropriate K,,- and 1/v values given in the appendix, one can 
estimate the amount of near-end crosstalk for fine line conductors 
carrying a wide variety of signals on any of the cPs considered in this 
paper. As discussed in Section IV, far-end crosstalk is always less 
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than near-end crosstalk, usually much less. Also, intralayer, far-end 
crosstalk can be bounded by using the experimentally determined 
constant | Ky| = 0.09 ns/ft and the theoretical results presented in 
Section IV. 

Let us consider as an example a pair of adjacent, parallel conductors 
on the inside signal layer of the 6L MLB (INT P/G, surface routing). 
From Table B of Figure 12 in the appendix, we see that K,. = 0.16 for 
two adjacent conductors (Yi Y2) in the 200-mil channel when the 
conductor width and conductor spacing are 8 and 9 mils, respectively. 
Table A of this same figure gives 1/v = 1.8 ns/ft. Thus, for a pulse 
signal, eq. (17) yields 





0.16, 2Tp > Tr 
NEXT = 8) 
= 27 
(0.16) (=). 2Ty = Tp, 
Tr 
where 
Tp = = propagation delay over the coupled length (ns) 
1, = coupled length (ft) 
Tr = rise time of the pulse signal (ns). 
Also, for the pulse signal, eq. (18) yields 
FEXT < (0.09) de (44) 
Tr 


Similarly, for a sine wave signal of frequency f,, eqs. (26) and (27) 
yield 


NEXT = (2)(0.16) |sin 27f.T» | (45) 
FEXT = (0.09) l.27f,. (46) 


In a very similar manner, one can also estimate the NEXT and bound 
the intralayer FEXT for an arbitrary periodic or random signal by using 
eqs. (22), (25), (35), and (39). 

By using this method, one can estimate the NEXT and bound the 
FEXT for a wide variety of conductor pairs and a wide variety of signal 
types on any of the cps considered in this paper. 

For a required crosstalk constraint, the theoretical and experimental 
crosstalk results can be used to help determine routing restrictions on 
coupled length for general types of signals. Alternately, this informa- 
tion can be incorporated into computer-aided designs to help deter- 
mine whether a routed cp has violated a given crosstalk constraint 
associated with a particular signal type. In this manner, a routed cP 
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can be analyzed to detect potential crosstalk problems before the cp 
routing is finalized for manufacture. 

As a final point concerning the estimate of crosstalk, one can also 
estimate intralayer NEXT for a pair of adjacent, parallel conductors 
having a range of conductor spacings. It turns out that intralayer NEXT 
is essentially independent of conductor width (see Ref. 4). Accordingly, 
to estimate the value of K,,-(S) for a pair of adjacent, parallel conduc- 
tors (i.e., Y:Y2) having conductor spacing S, one can interpolate or 
extrapolate the values of K,. for S = 9 mils or 13 mils given in the 
appendix by assuming that K,.(S) is proportional to 1/S. It can be 
shown that this is a satisfactory assumption when 7 mils = S < 40 
mils, the region. of most interest in this paper. 

As an example, let us apply this method to a pair of adjacent parallel 
conductors having conductor spacing S (mils) on the inside signal layer 
of the 6L MLB (INT P/G, surface routing). For the 200-mil channel, 
Table B of Fig. 12 yields the following minimum mean square error 
cnnaee [13Kne(9) + 9Kne(13)] 147.9 

ne + ne . 
Kye (S) = (9) (13) (9? ana 132)S ~~ S 


This method can be applied to pairs of adjacent, parallel conductors 
on any cP considered in this paper. 


%. (47) 





5.3 Electrical comparison of the CPs 


The results in Table II and the appendix can be used to compare 
the various cps from the electrical point of view. For example, Table 
II shows that, of the three cps containing only two layers of metalli- 
zation, namely, the double-sided epoxy, the double-sided metal, and 
the bonded board, the double-sided epoxy board is inferior to the other 
two. It has a relatively high characteristic impedance and higher values 
of intralayer crosstalk. Recall that intralayer crosstalk is more trouble- 
some than interlayer crosstalk, which can be reduced considerably by 
using orthogonal routing on adjacent layers. 

Also, the double-sided metal board is somewhat better electrically 
than the bonded board because the impedance variations and crosstalk 
are less for the metal board. 

Table II also shows that the MLBs having an internal power and 
ground plane (INT P/G) are superior electrically to those having an 
external power and ground plane (ExT P/G). The MLBs having (INT 
P/G) have less impedance variations and yield less intralayer crosstalk. 

Notice from Table II that a wire-wrap cp and the double-sided 
(epoxy) CP are both inferior to the MLB styles from the electrical point 
of view. Also, the double-sided (metal) and the bonded board have 
electrical properties which are comparable to all the MLBs having (INT 
P/G). 
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Finally, the extender board, because of its special design, is clearly 
the best electrical design of all the cps considered in this paper. It has 
relatively little variation in characteristic impedance and very low 
crosstalk. 


5.4 Estimation of the capacitance and inductance of the conductors 


In certain applications of the cps, it is important to have an estimate 
of the value of C, the capacitance per unit length, and L, the inductance 
per unit length for the conductors on all of the cp styles. This 
information is important, for example, when one needs to estimate the 
electrical load on a driver circuit for certain ranges of frequency or rise 
times. The values of C and L can be estimated from the values of 
propagation delay per foot (1/v) and characteristic impedance Z; given 
in the appendix for each cp style. Using these values, C and L are 
given by 





_ (1/v) nf 

cae (48) 
pela 
L= (5 )2: m (49) 


For worst case estimates, 1/v should be increased by about 10 percent, 
since the values listed in the appendix are averages over about 20 
different conductor paths on each cp. 


5.5 Generalization to other dielectric materials 


If eqs. (48) and (49) are used in eq. (8) to reduce Kye, we have 


i one ore 
Rose | ee: 
tere (50) 


Equation (50) shows that K,,- is independent of the relative dielectric 
constant, ¢«, (effective). It can also be shown that the more general Kn. 
discussed at the end of Section 4.4 is also independent of the relative 
dielectric constant. Thus, the values of Kine given in the appendix apply 
when the cps are fabricated with any dielectric material. 

One can also show that the propagation delay per foot, 1/v, and the 
far-end crosstalk coefficient | Kye | are both proportional to Ve,, while 
the characteristic impedance, Zi, is inversely proportional to Ve,. Thus, 
many of the results in this paper can be applied when the cPs are 
fabricated with other dielectric materials such as ceramic, Teflon, or 
polyimide. 


5.6 Electrical characterization of backplanes 


In the physical design of large electronic systems, various styles of 
printed wiring-board backplanes are often used to interconnect CPs. 


1012 THE BELL SYSTEM TECHNICAL JOURNAL, MAY-JUNE 1979 


These backplanes are usually very similar to some cP styles considered 
in this paper. Accordingly, many results in this paper can be applied 
to help electrically characterize various styles of backplanes. 


VI. SUMMARY 


A Bell System packaging effort (BELLPAC packaging system) is 
now under way to develop a modular packaging system for packaging 
electronic equipment. This effort makes use of a suitable connector 
(963) and a number of circuit pack (cp) styles (ranging from wire-wrap 
cps to multilayer board cps) which have common features suitable for 
computer-aided design. 

This paper presents some experimental results concerning the pulse 
transmission properties of fine line printed conductors (e.g., width = 8 
mils, spaces = 9 mils) on various styles of cPs which include those in 
the BELLPAC hardware family of cps. The pulse transmission prop- 
erties include the characteristic impedance, the propagation delay, the 
rise time, the bandwidth, and the intralayer and interlayer pulse 
crosstalk. Theoretical scaling laws are developed to extend the appli- 
cation of the experimental crosstalk results to conductor spaces in the 
range of 7 to 40 mils. 

A simplified theoretical model is presented which leads, directly, to 
some basic crosstalk equations. Also, theoretical results are developed 
to extend the application of the experimental crosstalk results to 
arbitrary pulse signals, periodic signals, and random signals. 

The results in this paper can be applied to the: 

(t) Selection of a cP style for a given application. 

(ti) Estimation of crosstalk on a given cP style. 

(zz) Comparison of the electrical properties of the cp styles. 

(iv) Estimation of the capacitance and inductance of the conduc- 

tors. 

(v) Determination of the pulse transmission properties of the cP 

styles with various dielectrics. 

(vz) Electrical characterization of various styles of backplanes. 

The crosstalk results are very important since they tend to limit the 
packaging density of printed conductors on the cP styles by limiting 
the coupled length and spacing of parallel conductors. 
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APPENDIX 


Experimental Results 
Index to pulse transmission properties 


Figure No. Containing 
the Pulse Transmission 
Circuit Pack Style Properties 


Wire wrap 4 
Extender board 5 
Double-sided (epoxy) 6 
Double-sided (metal) 7 
Bonded board (LAMPAC) 8 
4L MLB (ExT P/G) 9 


6L MLB (ExT P/G) 10 
6L MLB (INT P/G) 11 
6L MLB (INT P/G, surface routing) 12 
8L MLB (INT P/G) 13 
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7 


LAYUP FOR THE WIRE WRAP BOARD 


TABLEA 


PULSE TRANSMISSION PROPERTIES OF THE 
WIRE WRAP BOARD 


AWG 30 WITH MILENE INSULATION AWG 30 WITH TEFLON INSULATION 


LOCATION CHARACTERISTIC LOCATION CHARACTERISTIC 
OF WRAP IMPEDANCE OF WRAP IMPEDANCE 
ON PIN 


200 mil 


100 mil 
CHANNELS BOTTOM = BOTTOM 


@ PROPAGATION DELAY = 1.4 ns/ft. (=1/v, MILENE), 1.3 ns/ft. (= 1/v, TEFLON). 
@ 80% RISE TIME ON TDR FOR 1 ft CONDUCTOR LENGTH = 2.0 ns (MILENE) 1.8 ns (TEFLON). 
@ BANDWIDTH FOR 1 ft CONDUCTOR LENGTH = 250 MHz (MILENE), 278 MHz (TEFLON). 





(1) THE 200 mil CHANNEL ALSO CONTAINS A GROUND PLANE. 


TABLE B 
Kye = MAXIMUM NEAR-END PULSE CROSSTALK") FOR 
VARIOUS WIRE PAIRS ON THE 
WIRE WRAP BOARD 


AWG 30 WITH MILENE INSULATION AWG 30 WITH TEFLON INSULATION 


LOCATION CROSSTALK LOCATION CROSSTALK 
OF WRAP (TIGHTLY COUPLED | OF WRAP (TIGHTLY COUPLED 
ON PIN PAIRS) ON PIN PAIRS) 


200 mil 
100 mil 


(1) CROSSTALK WAS MEASURED AS A PERCENTAGE OF THE INPUT STEP. 





(2) THE 200 mil CHANNEL ALSO CONTAINS A GROUND PLANE. 


Fig. 4—Impedance, propagation delay, rise time, bandwidth, and pulse crosstalk for 
the wire-wrap board with 963C-100 connectors. 
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12 __ 
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S2 
l 12 B-STAGE 
PAD 2 


@ 2 0Z CU CONDUCTORS @ALL DIMENSIONS IN MILS 


{a} NOTATION FOR INTERLAYER CROSSTALK ON THE 
EXTENDER BOARD. 





(b) NOTATION FOR INTRALAYER CROSSTALK ON THE 
EXTENDER BOARD. 


ELECTRICAL CHARACTERISTICS OF LARGEST (7.67” X 14.78”) 
EXTENDER BOARD (WITH 963C CONNECTORS) 


@ CHARACTERISTIC IMPEDANCE = 70 + 5OHMS. 
@ PROPAGATION DELAY = 1.80 ns/ft. 

@ 80 % RISE TIME ON TDR = 1.3 ns. 

@ BANDWIDTH = 385 MHz. 

@ MAXIMUM INTRALAYER NEXT: Y, Yo = 1.6 % 


@ MAXIMUM INTERLAYER NEXT: S, Sy = 0.3 % 


Fig. 5—Impedance, propagation delay, rise time, bandwidth, and pulse crosstalk for 
the extender board with 963C-100 connectors. 
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@20Z CONDUCTORS @ALL DIMENSIONS IN MILS 
{a) NOTATION FOR INTERLAYER CROSSTALK ON THE 
DOUBLE SIDED EPOXY PWB. 


5 LAYER P/S, OR G/S. 





(b) NOTATION FOR INTRALAYER CROSSTALK. 
W= 12,S= 13 MILS OR W=8,S=9 MILS. 


TABLEA 


PULSE TRANSMISSION PROPERTIES OF THE 
DOUBLE-SIDED EPOXY pwB'3 


) 
CONDUCTOR WIDTH = 12 mils CONDUCTOR WIDTH = 8 mils 
CONDUCTOR SPACING = 13 mils |} CONDUCTOR SPACING = 9 mils 
sou Paeene*| am [Ameen 
LAYER LAYER 


ROUTING IN 163 OHMS 158 OHMS 


200 mil 


ROUTING IN [a 


CHANNELS 


| 1s8.0HMS | 
| tes 
top mit | __GS2_| eo S| 
fa 


@ PROPAGATION DELAY = 1.5 ns/ft. (=1/v). 
© 80% RISE TIME ON TOR FOR 1ft CONDUCTOR LENGTH = 2.6 ns. 
© BANDWIDTH FOR 1ft CONDUCTOR LENGTH = 190 MHz. 


(1) THE TRANSITIONS FROM P/S, TO G/S. USED 10 VIAS. 


(2) THE P/G ROUTING ALSO OCCUPIES THE 200 mil CHANNELS. 

(3) NO COVERCOAT CASE. COVERCOAT (GFR) HAS NEGLIGIBLE EFFECT ON 
RISE TIME (OR BANDWIDTH) BUT DECREASES THE CHARACTERISTIC 
{MPEDANCE BY ABOUT 5% AND INCREASES THE PROPAGATION DELAY 
BY ABOUT 10%, 








LAYER 
GS. 





‘LAYER P/S, 








WHICH APPEAR ALTERNATELY ON LAYERS P/S, AND G/S>. 
W= 12,S = 13 MILS OR W=8,S=9 MILS. 


TABLE B 
Kye = MAXIMUM NEAR-END PULSE CROSSTALK'') FOR 
VARIOUS CONDUCTOR PAIRS ON THE 
DOUBLE-SIDED EPOXY PwB) 


CONDUCTOR WIDTH = 12 mils, CONDUCTOR SPACING = 13 mils 


= INTERLAYER INTRALAYER ALTERNATELY ON P/S,, 
LOCATION OF LAYER Y, Y, 
ROUTING 
200 34% | 12% | 8.0% 
mil 24% 
CHANNEL(2) GS, 


ets 
ie id 
pees eal 


Ma MMMMMAAAY 


CONDUCTOR WIDTH = 8 mils, CONDUCTOR SPACING = 9 mils 


| INTERLAYER INTRALAYER ALTERNATELY ON P/S,, G/S, 


LOCATION OF LAYER |Y, Yo|Y, Y/Y xXx 


200 
mil 
CHANNEL 





CHANNEL 
(1) CROSSTALK WAS MEASURED AS A PERCENTAGE OF THE SIGNAL STEP, 
(2) THE P/G ROUTING ALSO OCCUPIES THE 200 mit CHANNELS. 


(3) NO COVERCOAT CASE, COVERCOAT (GFR) INCREASES CROSSTALK BY 
ABOUT 10% OF THE TABULATED VALUES. 


Fig. 6—Impedance, propagation delay, rise time, bandwidth, and pulse crosstalk for the double-sided 
epoxy printed wiring board with 963C-100 connectors. 
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(a) NOTATION FOR INTERLAYER CROSSTALK ON THE 
DOUBLE SIDED METAL PWB. 


Ss + LAYER P/S, OR G/S 








(b) NOTATION FOR INTRALAYER CROSSTALK. 
W= 12,S= 13 MILS OR W=8,S=9 MILS. 


TABLE A 
PULSE TRANSMISSION PROPERTIES OF THE 
DOUBLE-SIDED METAL Pws!?) 


CONDUCTOR WIDTH = 12 mils | CONDUCTOR WIDTH = 8 mils 
CONDUCTOR SPACING = 13 mils | CONDUCTOR SPACING = 9 mils 
CHARACTERISTI 
aan STIC] swap. | CHARACTERISTIC 
pee IMPEDANCE IMPEDANCE 
(OHMS) LAYER (OHMS) 


[?5, | _esonus_|_PS | 104 Wms 


a eT 

ES CT 

a 
ee 


ws; 
Fa OR, Te =O] so | 

®@ PROPAGATION DELAY = 1.5 ns/ft. (=1/v). 

@ 80% RISE TIME ON TOR FOR 1 ft. CONDUCTOR LENGTH = 2.6 ns. 

© BANDWIDTH FOR 1 ft. CONDUCTOR LENGTH = 190 MHz. 


(1) THE TRANSITIONS FROM P/S, TO G/S. USED 10 VIAS. 


(2) THE P/G ROUTING ALSO OCCUPIES THE 200 mil CHANNELS. 

(3) NO COVERCOAT CASE. COVERCOAT (GFR) HAS NEGLIGIBLE EFFECT ON RISE 
TIME (OR BANDWIDTH) BUT DECREASES THE CHARACTERISTIC IMPEDANCE 
BY ABOUT 5% AND INCREASES THE PROPAGATION DELAY BY ABOUT 10%. 





LAYER 
LAYER P/S,! 
Ss im i G/S 


=: 
X4 


(c) NOTATION FOR CROSSTALK BETWEEN CONDUCTORS 
WHICH APPEAR ALTERNATELY ON LAYERS P/S, AND G/Sp. 
W= 12,S= 13 MILS OR W=8,S=9 MILS. 


TABLE B 
Kye = MAXIMUM NEAR-END PULSE CROSSTALK") FOR 
VARIOUS CONDUCTOR PAIRS ON THE 
DOUBLE-SIDED METAL PwB'?) 


CONDUCTOR WIDTH = 12 mils, CONDUCTOR SPACING = 13 mils 


| TERLAYER INTRALAYER _ JALTERNATELY ON P/S,, G/S. 


LOCATION OF 
200 8 
mil . g.2% | 24% | 6.8% 
CHANNEL(2) | 08 | 
mil 
pe oe 


'LAYER PIS, 
'- X, 
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CONDUCTOR WIDTH = 8 mils, CONDUCTOR SPACING = 9 mils 


INTERLAYER INTRALAYER ALTERNATELY ON P/S,, G/S 


LOCATION OF 
200 15% | 3.6% | 1.2% 
mil 1.2% 11% 3.2% 8.0% 
CHANNEL LS) 


Pua e ] 
| Lr | 
CHANNEL es, | 13 | — | —| 


(1) CROSSTALK WAS MEASURED AS A PERCENTAGE OF THE SIGNAL STEP. 
(2) THE P/G ROUTING ALSO OCCUPIES THE 200 mil CHANNELS. 


{3) NO COVERCOAT CASE. COVERCOAT (GFR) INCREASES CROSSTALK BY ABOUT 
10% OF THE TABULATED VALUES. 


Fig. 7—Impedance, propagation delay, rise time, bandwidth, and pulse crosstalk for the double-sided 


metal printed wiring board with 963C-100 connectors. 
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(a) NOTATION FOR INTERLAYER CROSSTALK ON THE 
BONDED BOARD. 


s LAYER P/S, OR G/Sy 





(b) NOTATION FOR INTRALAYER CROSSTALK. 
W= 12,$= 13 MILS OR W= 8,S = 9 MILS. 


TABLE A 
PULSE TRANSMISSION PROPERTIES OF THE 
BONDED BOARD’) 


CONDUCTOR WIDTH = 12 mils | CONDUCTOR WIDTH = 8 mils 
CONDUCTOR SPACING = 13 mils | CONDUCTOR SPACING = 9 mils 
CHARACTERISTIC CHARACTERISTIC 
IMPEDANCE SIGNAL | IMPEDANCE 
(OHMS) LAYER 


100 mil 


CHAN [2 [ia 


@ PROPAGATION DELAY = 1.5 ns/ft. 7 7 
@ 80% RISE TIME ON TDR FOR 1 ft. CONDUCTOR LENGTH = 2.6 ns. 
@ BANDWIDTH FOR 1 ft. CONDUCTOR LENGTH = 190 MHz. 


ROUTING IN 





(1) THE TRANSITIONS FROM P/S, TO G/S. USED 10 VIAS. 
{2) THE P/G ROUTING ALSO OCCUPIES THE 200 mil CHANNELS. 


{3) NO COVERCOAT CASE. COVERCOAT (GFR) HAS NEGLIGIBLE EFFECT ON RISE 
TIME (OR BANDWIDTH) BUT DECREASES THE CHARACTERISTIC IMPEDANCE 
BY ABOUT 5% AND INCREASES THE PROPAGATION DELAY BY ABOUT 10%. 
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(1) CROSSTALK WAS MEASURED AS A PERCENTAGE OF THE SIGNAL STEP. 
(2) THE P/G ROUTING ALSO OCCUPIES THE 200 mil CHANNELS. 


(3) NO COVERCOAT CASE. COVERCOAT (GFR) INCREASES INTRALAYER 
CROSSTALK BY ABOUT 10% OF THE TABULATED VALUES. 


Fig. 8—Impedance, propagation delay, rise time, bandwidth, and pulse crosstalk for the bonded 


board with 963C-100 connectors. 
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(b) NOTATION FOR INTRALAYER CROSSTALK. 
W = 12,S = 13 MILS OR W = 8,S =9 MILS. 
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(1) THE TRANSITIONS FROM S, TOS. USED 10 VIAS. 
(2) THE EXTERNAL P/G ROUTING ALSO OCCUPIES THE 200 mil CHANNELS. 
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(1) CROSSTALK WAS MEASURED AS A PERCENTAGE OF THE SIGNAL STEP. 
(2) THE EXTERNAL P/G ROUTING ALSO OCCUPIES THE 200 mil CHANNELS. 


Fig. 9—Impedance, propagation delay, rise time, bandwidth, and pulse crosstalk for the 4L MLB (ExT P/G) with 


963C-100 connectors. 


SGYVOd DONIYIM G3ALNiYd 


LZOL 


F107 = S, W=8,S=9MILS, 
! 15 C-STAGE 
(a) NOTATION FOR INTERLAYER CROSSTALK _! ¥ fs ccsrace So 
ON THE GL MLB (EXT P/G). eek lll? B-STAGE TABLE B 

3 

15 C-STAGE Kye = MAXIMUM NEAR-END PULSE CROSSTALK"! FOR 
4 VARIOUS CONDUCTOR PAIRS ON THE 

Yh mamta 6L MLB (EXT P/G) 


WwW 
(b) NOTATION FOR INTRALAYER CROSSTALK. i 
W= 12,8 = 13 MILS OR W=8,S=9 MILS 


1 1 
(c) NOTATION FOR CROSSTALK BETWEEN fy LAYERS), LAYERS, , LAYERS, 
CONDUCTORS WHICH APPEAR ALTERNATELY =e 
Ww 







mz ON LAYERS S, AND S,. W= 12,S= 13 MILS OR 
B-STAGE 












oe WIDTH = eee mils, CONDUCTOR SPACING = 13 mils 
@ 20Z CU CONDUCTORS @ ALL DIMENSIONS IN MILS 









INTERLAYER | INTRALAYER | ALTERNATELY ONS, ,S, 


— OF 
$)S, | S9$ 


| 1% | 
TABLE A mil 35% | 5.0% 16% 1.2% 8.0% 
CHANNEL?) psi ne 
PULSE TRANSMISSION PROPERTIES OF THE 
6L MLB (EXT P/G) | Sy | 6 | 





LAYERS, 
































CONDUCTOR WIDTH = 12 mils CONDUCTOR WIDTH = 8 mils 100 al 
CONDUCTOR SPACING = 13 mils | CONDUCTOR SPACING = 9 mils mil Eso 
CHARACTERISTIC CHANNEL + tat 


CHARACTERISTIC 
SIGNAL 
LAYER IMPEDANCE 













IMPEDANCE 
SSS CQ iw 
CONDUCTOR WIDTH = 8 SSS CONDUCTOR SSS =9 mils 









ROUTING IN 
200 mil 
CHANNELS'2) 


| | _INTERLAYER INTRALAYER | ALTERNATELY ONS, ,Sq 


LOCATION OF 
100 mit 


a te] 
a 169 13% 14% 
[95 | mil 8.0% | 30% | 5.0% * _ : 

; 8s CHANNEL = 


cy, 7 ea] 

7 er 

@ PROPAGATION DELAY = 1.8 ns/ft. (=1/v). mil 28 3s, |e 
@ 80% RISE TIME ON TOR FOR 1 ft. CONDUCTOR LENGTH = 2.5 ns. CHANNEL 3 

@ BANDWIDTH FOR 1 ft. CONDUCTOR LENGTH = 200 MHz. | Ss { 32 | 


(1) THE TRANSITIONS FROM S, TOS, USED 5 OR 10 VIAS. (1) CROSSTALK WAS MEASURED AS A PERCENTAGE OF THE SIGNAL STEP. 
(2) THE EXTERNAL P/G ROUTING ALSO OCCUPIES THE 200 mil CHANNELS. (2) THE EXTERNAL P/G ROUTING ALSO OCCUPIES THE 200 mil CHANNELS. 
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Fig. 10—Impedance, propagation delay, rise time, bandwidth, and pulse crosstalk for the 6L MLB (ExT P/ 
G) with 963C-100 connectors. 


6Z6L ANN-AVW “IWNYNOP TIWOINHOSZL WALSAS 1138 SHL 2@2z0b 


D1 


~B-STAGE 





C-STAGE 
2 
1 STAGE 
_ == PAD 2 


@20ZCUCONDUCTORS @ ALL DIMENSIONS IN MILS 


(a) NOTATION FOR INTERLAYER CROSSTALK ON THE 
6L MLB (INT P/G). 


Se LAYER S; 





{b) NOTATION FOR INTRALAYER CROSSTALK 
W= 12, = 13 MILS OR W=8,S=9MILS. 


TABLE A 
PULSE TRANSMISSION PROPERTIES OF THE 
GL MLB (INT P/G) 


CONDUCTOR WIDTH = 12 mils CONDUCTOR WIDTH = 8 mils 
CONDUCTOR SPACING = 13 mils | CONDUCTOR SPACING = 9 mils 


SIGNAL CHARACTERISTIC SIGNAL CHARACTERISTIC 
LAYER IMPEDANCE LAYER IMPEDANCE 
(OHMS) 
ROUTING IN Se ao 72 OHMS 
ee ee 
rourmoin{ $1 |! TS 
ct A 
as Ds ee ed 
@ PROPAGATION DELAY = 1.9 ns/ft. (= 1/v). 


© 80% RISE TIME ON TDR FOR 1 ft. CONDUCTOR LENGTH = 1.8 ns. 
@ BANDWIDTH FOR 1 ft. CONDUCTOR LENGTH = 278 MHz. 
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(1) CROSSTALK WAS MEASURED AS A PERCENTAGE OF THE SIGNAL STEP. 
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Fig. 11—Impedance, propagation delay, rise time, bandwidth, and pulse crosstalk for the 6L MLB (INT P/G) with 


963C-100 connectors. 
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(1) THE TRANSITIONS FROM S, TO S, USED 10 VIAS. (1) CROSSTALK WAS MEASURED AS A PERCENTAGE OF THE SIGNAL STEP. 
(2) NO COVERCOAT CASE. ON LAYERS S,, Sq, COVERCOAT (GFR) HAS NEGLIGIBLE (2) NO COVERCOAT CASE. ON LAYERSS.. $,. COVERCOAT (GFR) INCREASES 
EFFECT ON RISE TIME (OR BANOWIDTH) BUT DECREASES THE CHARACTERISTIC INTHALAVER GROSTHALICRY ALGIIT du GE-THE TAGUURTER VALUES: 


IMPEDANCE BY ABOUT 10% AND INCREASES THE PROPAGATION DELAY 
BY ABOUT 10%. 


Fig. 12—Impedance, propagation delay, rise time, bandwidth, and pulse crosstalk for the 6L MLB 
(int P/G, surface routing) with 963C-100 connectors. 
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(1) THE TRANSITIONS FROM S. TO S3 USED 10 VIAS. (1) CROSSTALK WAS MEASURED AS A PERCENTAGE OF THE SIGNAL STEP. 





Fig. 13—Impedance, propagation delay, rise time, bandwidth, and pulse crosstalk for the 8L MLB (INT P/ 
G) with 963C-100 connectors. 
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High Resolution, Steep Profile, Resist Patterns 


By J. M. MORAN and D. MAYDAN 
(Manuscript received September 26, 1978) 


High resolution and steep profile patterns have been generated in 
a 2.6-um thick organic layer which conforms to the steps on a wafer 
surface and is planar on its top. This thick organic layer (a photore- 
sist in the present experiments) is covered with an intermediate layer 
of SiOz and a top, thin layer of X-ray or photoresist. After exposure 
and development of the top resist layer, the intermediate layer is 
etched by CHF; reactive ion etching. The thick organic layer is then 
etched by Oz reactive ion etching. Submicron resolution with essen- 
tially vertical walls in the thick organic material was achieved. The 
technique is also applicable to photo- and electron lithography. It 
reduces the need for thick resist patterns for the lithography step and, 
at the same time, ensures high resolution combined with good step 
coverage. 


1. INTRODUCTION 


One of the more difficult problems with resist pattern generation is 
to achieve good linewidth control, high resolution, and good step 
coverage simultaneously. Often the requirements appear to be mu- 
tually exclusive; good step coverage requires thick resist; high resolu- 
tion, however, is more easily obtained in thin resist. This is true for all 
resists, both positive and negative. 

With any resist, the ideal conditions to obtain high resolution and 
good linewidth control are a flat surface and a thin resist (3000-4000 
A°). The flat surface means that the resist has very little variation in 
thickness and that, as a result, there will be little variation in resist 
line width. However, such resist line width variations will occur when 
lines traverse a step. As device wafers do have steps, thick resist (7000- 
15000 A) must be applied to achieve coverage over steps. 

We discuss here a method for generating high resolution, steep 
profile resist patterns by first preparing a flatter surface on the wafer.’ 
This is done by applying a layer of thick organic material that conforms 
with its lower surface to the wafer and is planar on its top. The thick 
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Table I—High resolution, steep profile, resist patterns 
Advantages 


. Planar surface for resist patterning 

. Excellent step coverage 

Good linewidth control 

. Thinner resist can be used for better resolution 

. Eliminates standing waves and scattering in photolithography 
. Reduces proximity effects in electron lithography 

. Minimal resist erosion during substrate etch by plasma or ions 


NOOO oh 


Disadvantage 


Requires extra processing steps 


4000 A PHOT 
REACTIVE R.F. SPUTTER ETCH-CHF3 EEECERON. a 


1200 A X—RAY RESIST 


a A el 
















\ 2-3yum 

THICK 

VLSI SUBSTRATE RESIST 

REACTIVE R.F. SPUTTER ETCH-O, 5105 

dey pda 
Oe 

RESIST 
-—S1,02 


VLSI SUBSTRATE 


eis 1—Schematic presentation of the various steps required to define a steep resist 
profile. 


layer is then patterned using an intermediate masking layer and a thin 
top layer of X-ray resist. The result is that as much as 2.6 wm of 
plasma-resistant organic material can be patterned with better than 1- 
um resolution and steep sidewalls comparable with those in positive 
photoresist. The advantages and disadvantages of this technique are 
outlined in Table I. 


il. EXPERIMENT 


A 2.6-um thick layer of photoresist,* serving as the thick organic 
layer, was spun on a silicon wafer. The intermediate layer of 0.12 um 


* HPR-204, manufactured by Hunt Chemical Co. 
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of silicon dioxide was plasma-deposited at 200°C on the photoresist, 
and then a 1.0-um thick layer of chlorine-based negative X-ray resist 
was deposited on top of the oxide. Figure 1 is a schematic presentation 
of the processing sequence. 

The top layer of X-ray resist was exposed and developed to a final 
thickness of 0.45 um using an X-ray exposure tool.”* With the X-ray 
resist as a mask, the SiO» was either plasma- or reactive-ion-etched 
with a CHF; gas. The pattern was then transferred into the thick 
organic (resist) layer using reactive RF sputter etching, with pure O» 
gas forming the plasma and the SiO: acting as the mask. The RF power 
density was 0.50 watt/cm? and the time required to etch the resist was 





Fig. 2—Pattern etching into 2.6-um thick resist layer using reactive RF sputter etching. 
Trench width is 1.5 pm. 
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20 minutes. Figure 2 shows the resultant pattern, which is 2.6 um high 
and has a trench width of 1.5 ym. The photos were taken with a 
scanning electron microscope at a very steep angle to clearly show the 
wall structure of the resist. Note that the walls are perpendicular and 
there is very little undercut. The oxide is still on top of the organic 
(resist) layer, and its thickness loss during the sputter etch was less 
than 0.02 um. 

Another method of etching the thick organic layer is with plasma 
etching using a radial flow machine. This method, however, produced 
patterns with some undercut and had a resist etch uniformity variation 
across a 3-in. wafer of 50 percent, which, coupled with the undercut, 
would give poor linewidth control. 


1.0um H 





Fig. 3—sEm photograph of texture surrounding all the etched features. The fibers of 
this texture are aluminum oxide and are submicron in diameter. 
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One of the more serious problems encountered with RF reactive 
etching was the presence of texture after completion of the etch (see 
Fig. 3). This texture,*” as determined by Auger analysis, consisted of 
aluminum oxide fibers that were resistant to further plasma processing. 
Their formation was due to the presence of aluminum in the active 
plasma area. In fact, the presence of any metals (e.g., copper, titanium, 
tantalum, etc.) also produced this texture. 

Solution to the texture problem involved the construction of a 
chamber having no exposed metallic surfaces. Figure 4 shows a pattern 
etched under these conditions. The thick organic layer in this case is 


1.0um H 





Fig. 4—Pattern sputter-etched using a system having no exposed metallic surfaces. 
Etching has caused no fiber formation. 
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ED) 





Fig. 5a—1-ym lines and spaces on top of phosphosilicate glass steps. Note uniformity 
in line width. 


2.0 um of Novolac-based positive photoresist. Again, note the straight 
walls and high resolution. 

Although this technique dramatically demonstrates high resolution 
capabilities in thick resists, its most useful application is in patterning 
over steps. Figure 5a shows 1-ym lines with 1-um spacings going over 
1-um high steps in flowed phosphosilicate glass. There is no measurable 
variation in linewidth as the 1-um feature traverses a step. Figure 5b 
is a profile sEM photo of the same features showing the conforming 
lower surface and the planar top which make the high resolution 
possible. 

To further demonstrate the high resolution capabilities of this type 
of system, Fig. 6 shows that a 0.5-um window can be patterned in the 
thick resist. Figure 7 shows a 0.45-um trench that was made by using 
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Fig. 5b—Profile of Fig. 5a showing step coverage and flat top surface. Thick resist 
varies from 2.8-um high in valley to 2.1 um on top-of a step. Resist width is 1 pm. 


a 0.3-um thick positive photoresist for the top layer. This feature was 
exposed in the photoresist using a projection photomask aligner. 


lll. DISCUSSION 


The three-layer technique to obtain high resolution features need 
not be applied solely to X-ray lithography. This technique has various 
advantages for other lithographic systems as well. 

For optical lithography, the thick underlying layer of resist and the 
layer of SiOz are sufficient to reduce the reflection from the wafer 
surface and, as a result, reduce standing-wave problems. The flat 
surface of the thick organic layer will keep scattering down and the 
top resist can be made thin for high resolution. 
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Fig. 6—A 0.5-um trench in 2.3-um thick resist. 


For electron beam lithography, backscattering from a substrate 
covered by 0.1 pm SiOz on top of 2 um to 3 ym of polymer should be 
less than that from an Si or SiO2/Si substrate. Thus, proximity effects 
existing in that technology might be reduced and better linewidth 
control can be expected. In addition, high-speed negative resists, such 
as COP, might be considered for patterning since the top layer of resist 
need not be thick. 

Although a three-layer technique (i.e., top radiation sensitive resist, 
SiOz, bottom organic (resist) layer) has been discussed here, a two- 
level scheme®’ without SiO» also works if the top patterned resist is 
very resistant to the reactive RF sputter etch. In addition, there are 
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Fig. 7—A 0.45-um trench RF etched into a 2.0-um thick layer of organic material 
(resist). Initial patterning was done with projection photolithography. 


many choices for the intermediate and the thick organic layers. The 
intermediate layer could be silicon nitride, boron nitride, or some other 
suitable low-defect material. For the bottom layer, many positive and 
negative resists familiar to photo and electron lithography may be 
used, as well as other polymers. 


IV. CONCLUSION 


The steep profile three-layer pattern generation technique presents 
many potential advantages and applications to all phases of lithogra- 
phy. Submicron resolution features with better linewidth control and 
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good step coverage are possible. The problem regarding the deposition 
of high-quality and low-defect density material for the intermediate 
layer remains to be evaluated. 
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A New Selective Fading Model: Application to 
Propagation Data 
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Channel transmission models for use in estimating the performance 
of radio systems on line-of-sight paths at 6 GHz are explored. The 
basis for this study is the simple three-ray multipath fade, which 
provides a channel transfer function of the form H(w) = a[1 — b exp 
— J(w — wo)t], where a is the scale parameter, b is a shape parameter, 
7 is the delay difference in the channel, and «a is the (radian) 
frequency of the fade minimum. This model is indistinguishable from 
an ideal channel model, within the accuracy of existing measure- 
ments. The propagation data that confirm the model were obtained 
in summer 1977 from a 26.4-mile hop near Atlanta, Georgia. The 
received power at 24 sample frequencies spaced at 1.1 MHz and 
centered on 6034.2 MHz was continuously monitored and recorded 
during periods of anomalous behavior. The model is applied to 
estimating the statistics of the channel delay difference, tr. The aver- 
age delay difference giving rise to significant selectivity in the channel 
is between 5 and 9 ns. The distribution of delay difference is obtained 
for delay differences greater than 10 ns. The channel is found to have 
more than 3 dB of selectivity (difference between maximum and 
minimum attenuation in band) due to delay differences greater than 
20 ns for more than 70 seconds in a heavy fading month. (This is 
comparable to the time the channel attenuation of a single frequency 
exceeds 40 dB.) The three-path model requires further simplification 
for narrowband channel application. For a channel with 30 MHz 
bandwidth, a model with fixed delay of 6.3 ns provides a sufficiently 
accurate representation of all observed channel conditions. The re- 
sulting nonphysical model is used to statistically characterize the 
condition of the fading channel. The statistics of the parameters of 
the fixed delay model are almost independent and of relatively simple 
form. The distribution of the shape parameter b is of the form (1 — 
b)**. The distribution of a is lognormal. For b > 0.5, the mean and 
standard deviation of —20 (log a) are 25 and 5 dB, respectively; the 
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mean decreases to 15 dB for smaller values of 6. The probability 
density function of wo is uniform at two levels; measuring wo from the 
center of the band, the magnitude of wot is five times as likely to be 
less than m/2 than to be greater. A companion paper describes the 
use of this model for determining the bit error rate statistics of a 
digital radio system on the modeled path. 


1. INTRODUCTION 


Performance prediction of a digital radio system on a line-of-sight 
microwave channel requires an accurate statistical model of the chan- 
nel. Because different digital radio systems may have different sensi- 
tivities to the various channel impairments, the model must be com- 
plete to the extent that it must be capable of duplicating the amplitude 
and phase (at least approximately) of all observed channel conditions. 
To facilitate laboratory measurements and computer simulations for 
calculating outage, the model should be realizable as a practical test 
circuit and should have as few parameters as possible. Most important, 
the parameters should be statistically well behaved. 

Two types of models have been generally considered for line-of-sight 
microwave radio channels: power series type models’ and multipath 
models.*® A power series model will require a few terms only if the 
channel is a multipath medium with a small spread of delays relative 
to the reciprocal bandwidth of the channel.? This implies that one 
must understand the channel as a multipath medium to understand 
the behavior of a power series model. Hence, we have limited our 
characterization efforts to multipath models. 

The basis for this study is the simple three-ray multipath fade.’ If 
the fading in a channel can be characterized by a simple three-path 
model, the channel will (as shown in Section II) have a voltage transfer 
function of the form 


H(w) = a{1 — ber], (1) 


where the real positive parameters a and b control the scale and shape 
of the fade, respectively, t is the delay difference in the channel, and 
wo is the radian frequency of the fade minimum. The plus and minus 
signs in the exponent correspond, respectively, to the channel being in 
a nonminimum phase or minimum phase state. Note that, with appro- 
priate choices of parameters, this model can be reduced to a two-path 
model or a scaled two-path model, etc. 

It has been shown previously,’ and is illustrated in Section II, that 
the simple three-path fade overspecifies the channel transfer function 
if the delay is less than % B, where B is the observation bandwidth. 
The critical value of + for a 30-MHz channel is about 5.5 ns, which is 
comparable to the mean delay in the channel. As a consequence, unless 
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the channel response can be determined to an accuracy on the order 
of 0.001 dB, a unique set of parameters a, b, r, and f, cannot be 
determined for more than half the faded channel conditions encoun- 
tered. To avoid this problem, one must suppress or fix one of the 
model parameters. Section II shows that the delay, 7, is the only 
parameter which, when fixed, produces a reasonable model. 

While a model with a fixed delay may appear to be a strange choice, 
it has all the required characteristics for modeling the channel transfer 
function. Figure 1 shows the amplitude of the channel transfer function 
of eq. (1) on a power scale and on a decibel scale for r = 6.31 ns. With 
7 fixed, the response minimum is shifted with respect to frequency by 
varying f,. Varying a changes the overall level and b changes the 
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Fig. 1—Channel model function. H(w) = a[1 — 6 exp(—j 2 7(f— f.)7)], for 7 = 6.3 ns, 
a= 0.1, 6 = 0.7. 
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“shapeliness.” If the minimum is within the 30-MHz bandwidth of a 
channel, the fixed delay model can generate notches with a wide range 
of levels and notch widths. With the minimum out of band, it can 
generate a wide range of combinations of levels, slopes, and curvatures 
within the channel bandwidth. Section VI shows that the model 
versatility, with 7 chosen to be 6.31 ns, is sufficient to characterize a 
30-MHz channel in the 6-GHz common carrier band. 

Section II provides a brief discussion of the simple three-path fade. 
A comparative discussion of the relative merits of the different possible 
simplifications of this model leads to the choice of the fixed delay 
model. 

The data used for detailed evaluation of models were obtained from 
a 6-GHz experiment in Palmetto, Georgia, in June 1977. The radio 
channel was equipped with a general trade 78-Mbit/s, 8-PSK digital 
radio system, and the received spectrum was monitored with a set of 
24 filters with bandwidths of 200 kHz spaced at a 1.1-MHz separation 
across this channel. During fading activity, the received power of each 
of these frequencies was measured five times each second, or once 
every 2 seconds, depending on how rapidly the channel was changing; 
sampled power, quantized in 1-dB steps, was recorded by the MIDAS 
system.* The data base used for this study consists of approximately 
25,000 scans representing 8400 seconds of fading activity; about 8700 
scans were recorded during periods when the equipment was indicating 
errors. These data represent about 60 percent of the fading activity of 
a heavy fading month; therefore, the derived statistics must be viewed 
as provisional and subject to some modification as additional data are 
processed. At the very least, the data base is sufficiently large to 
indicate what can happen on the channel and to form a basis for 
choosing and validating a model. 

As described in Section III, the model parameters were estimated 
for each scan by fitting the magnitude squared of the transfer charac- 
teristic [eq. (1)] to the observed channel shape as characterized by the 
power received at the sampling frequencies. Phase is subsequently 
derived by assuming the channel is minimum phase. Problems are 
encountered in realizing a minimum-phase solution because of quan- 
tization noise and the presence of certain channel shapes caused by 
large delays. The procedure for handling these difficulties is described. 

The statistics of the parameters of the fixed delay model are dis- 
cussed in Section IV. Equations providing an idealized description of 
the statistics of the parameters of the model are also given here. 

In Section V, the determination of the delay difference present in 
the channel is considered. In the first subsection, it is demonstrated 


ae Input Data Acquisition System, constructed by G. A. Zimmerman; see 
Ref. 1. 
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that, during the observed period of fading activity, the average delay 
is 9 ns. A lower bound on the distribution of delay difference for large 
delays is developed in the second subsection. A third subsection 
provides an example of a channel scan that can best be approximated 
by a three-path fade with a delay difference of 26 ns. Fades with at 
least this delay and with a more moderate amount of shape (2 dB or 
more) were encountered for about 60 seconds of the data base studied. 
Thus, one might expect 26-ns delays to be present during about 100 
seconds of a heavy fading month. 

The presence of such large apparent delays raises questions as to 
the accuracy with which the fixed delay model represents the channel. 
These questions are addressed in Section VI where the statistics of the 
errors in modeling scan fits are described. The errors are small and do 
not compromise the usefulness of the model. 

Results and conclusions are briefly summarized in Section VII. 


Il. CHOICE OF MODEL 


In this section, we provide a brief description of the simple three- 
path model and show why it cannot be used to estimate delays when 
the delay bandwidth product is less than %. In a comparative discus- 
sion, we show why the fixed delay model is the only simplification of 

-the model that is manageable. 


2.1 Simple three-path model 


Consider a channel characterized by three paths or rays. The am- 
plitude of the signal on each of these three paths, as seen by the 
receiver, is 1, a;, and a2. The second and third paths are delayed with 
respect to the first by 7: and 72 seconds, respectively, where 72 > 71. We 
define the simple three-path model by requiring the delay between the 
first two paths to be sufficiently small, i.e., 


(we — w1)71 < 1, (2) 


where w2 and w are the highest and lowest (radian) frequencies in the 
band. The complex voltage transfer function of the channel at a 
frequency w may be illustrated with a phasor diagram. Figure 2a shows 
the phasor diagrams for w; and w2 superimposed. By designating the 
amplitude of the (vector) sum of the first two paths by a; the angle of 
the sum by ¢ = wot — 7, where 7 is equal to 72, the delay difference in 
the channel; and the amplitude of the third ray by ab, we obtain the 
simplified diagram in Fig 2b.* 


* Note that, if the third amplitude is greater than the sum of the first two, we 
interchange the assignments of amplitudes a and ab and obtain a nonminimum phase 
fade. 
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The simple three-path fade cannot be used for a channel model 
because the path parameters lack uniqueness. The basic difficulty is 
illustrated by the two superimposed fades in Fig. 3. Note that the 
amplitudes of the transfer functions of these two fades match, at 


H(w) = 1 +a,e7/@77 + ay e iWT2 






W171 =W2T, 


sia 


ab = ap 


(4 —W)T 





H(w) =a (1-be!(W- Ww) 7) 
(b) 


| Fig. 2—Simple three-path fade. (a) Three rays shown. (b) Simplified. 


Hy (wo) = Ho (wo) : 
119 2'w2 Hy (wo) =a, (1b, e7iT1) 


Ho (w) = ag (1-by eiWT2) 





Hy (w4) = Ho (w4) 


Fig. 3—Two degenerate simple three-path fades with wor = 0. 
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midband and at both edges. It has been shown elsewhere’ that fades 
matched in this way will be identical in band to within a few tenths of 
a decibel at most, and will have almost identical envelope delay 
distortion. Given noisy quantized measurements of | H(w)| over the 
band, it is impossible to distinguish between such fades unless we fix 
one of the four parameters. Let us consider each of the four possibili- 
ties. 


2.2 Pseudo two-path fade 


If one fixes the amplitude, a, at unity, the simple three-path fade 
reduces to a two-path fade with independent control of the frequency 
of the minimum of the response. The difficulty with this model, as 
may be seen by referring to Fig. 2b, is that it can provide in-band 
minima only for |H(w) | < 1 and maxima in-band only for | H(w) | > 1. 
In other words, the model cannot match an in-band maximum at an 
arbitrary fade level. In addition, it was found that during approximately 
half the periods when the radio equipment was indicating errors, the 
channel could not be well modeled with a pseudo two-path model. 


2.3 Scaled two-path fade 


If one fixes the phase, @ = wot — 7, in the simple three-path model 
at 0, the fade reduces to a scaled two-path fade. (For a two-path fade, 
we require the additional condition a = 1.) This is the most physically 
desirable of the reduced three-path models because it may be derived 
without recourse to the three-path formalism. Unfortunately, it is 
mathematically intractable, particularly when dealing with amplitude 
data only. In fitting the model to a given channel shape (in the manner 
described in Section III for the fixed delay model), one obtains a 
function of a, b, and 7 that must be minimized to obtain the best fit. 
Because of the wr term in the exponent of the model, this function has 
a local minimum in every interval of 7 of length 0.17 ns, the reciprocal 
of 6 GHz. Since the possible range of + extends to about 30 ns, one 
may have to perform hundreds of minimizations to find the best fit to 
a single channel scan. Even then this “best fit” may have no minimum 
phase realization, and there is no known procedure that leads to one. 


2.4 Fixed b model 


If one fixes the amplitude 6b in the simple three-path model, the 
resulting reduced model has all the mathematical difficulties of the 
scaled two-path model and no satisfactory physical interpretation. 


2.5 Fixed delay model 


It is demonstrated in the remainder of this paper that the fixed 
delay model described in Section I is useful and effective in character- 
izing the channel. 
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ii. ESTIMATION PROCEDURES 


This section describes how the model parameters are estimated from 
the channel scans and how realizability difficulties are surmounted. 


3.1 Parameter estimation 


The channel data consist of a set of 25,000 scans of the channel 
power spectrum. Each scan consists of a power measurement at each 
of 24 frequencies at 1.1-MHz spacing across the channel. (Actually, 
only 23 frequencies are used since the 19th was inoperative during this 
test period). The power measurements are recorded in decibels, and 
each must be referenced to the average power level of that frequency 
at mid-day. With proper conversion and calibration, the basic data 
characterizing a scan are a set of power ratios. We designate the power 
ratio at nth frequency by Y,, where 


Yn=Y(wn)  n=1,2)-++, 24. (3) 


We wish to model the channel with a voltage transfer function of 
the form given in eq. (1), which we repeat here for convenience 


H(w) = a{1 —- beer], (1) 
Thus our estimate of Y, will be 
Y, =| H(wn)|? = a — B Cos(wn — wo)r, (4) 
where 
= a*(1 + 6’) 
B = 2a°b. (5) 


For convenience, we measure frequency in the units of the frequency 
separation of the power measurements. Thus, 


Wn = 2afn = 2n(1.1 x 10°) n=1, 2,3, ---, 24. (6) 

If we choose 

1 
fe eee 1 
7 N(L1 X 109)’ (7) 
then 
n 

n = 2 vale ® 8 
WnT WT N ( ) 


For the fixed delay model, we choose N = 144 which gives a model + 
of 6.31 ns. Thus, the in-band frequencies correspond to n values 
between 1 and 24, and the channel transfer function given by the 
model is periodic for n modulo 144, corresponding to a frequency shift 
of 144 x 1.1 x 10° = 158.4 MHz. 
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The weighted mean-square error between the estimated and ob- 
served power is given by 


24 ; 
> CilYn — Y,.)° 
|: Or. (9) 


where the summation skips n = 19 as described above, and where C,, 
is a weighting applied to the measurement at frequency w,. Since the 
original data, from which the Y,,’s were derived, were uniformly quan- 
tized on a logarithmic scale, it is appropriate to use a weighting that is 
approximately logarithmic. Hence, we use the weighting function 


1 


Cn = 3 


(10) 
A number of different weighting functions were tested, but the one 
given by (10) is, generally, the most satisfactory. 

Estimates of a, b, and f, may be obtained by minimizing the weighted 
mean-square error, E. It is shown in the appendix that one may obtain 
closed form estimators for a, 8, and f, by substituting eq. (4) into (9) 
and minimizing £, first with respect to a, then with respect to B (or 
vice versa), and last with respect to f,.. In the resulting scheme, the 
estimator of f,, the frequency of the model minimum, is a function of 
data only. The estimators of a and £ are functions of the estimated f, 
and the data.* 

After estimates of a and £ have been calculated, the parameters a 
and b of the model are obtained by inverting the relationships given 


by eq. (5). 
2 1/2 
Q@ Q@ 
b=—-|(-| -1 11 
B (F) | 7 


7 B 1/2 


It is clear from (11) and (12) that we can realize the channel shape 
with the model only if a = £. This is to be expected. Since | H(w)|? is 
a power transfer function, it must be positive for all frequencies, which 
is possible only if a = B [see eq. (4)]. Thus, the condition a = B allows 
us to obtain a minimum (or nonminimum) phase transfer function 
whose magnitude squared is the minimum weighted mean-square error 
fit to the observed power transfer response of the channel. 


* For mathematical simplicity, we actually use an estimator for 8 conditioned on f,, 
a, and the data. 
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3.2 Application of estimators 


If the procedure described above is strictly applied to the set of 
25,000 scans in the data base, one finds that about 35 percent of the 
scans cannot be modeled with real values of a and b. A study of these 
problem scans revealed that the estimator for f,, the frequency of the 
fade minimum, was biased for two types of scans. One type is a scan 
with little shape, dominated by quantization noise; the other is a 
selective channel shape having a steep slope across the band. Both 
types of scan are illustrated in Fig. 4. The scan in Fig. 4, which is 
almost flat, was fabricated to illustrate the severity of the quantization 
problem. The other scan is typical of the more shapely troublesome 
scans. 

To obtain a good realizable fit to such channel shapes requires 
degrading the quality of the fit; that is, moving the parameters away 
from the values that minimize the fit error, eq. (9). Given the form of 
the estimation scheme, this is easily accomplished by moving the 
frequency of the fade minimum, f,, away from its original “optimum” 
value and reoptimizing the remaining parameters to obtain values of 
a and b that are optimum for the new value of f,. Figures 5 and 6 
illustrate the results of such a quasi-optimization regarding f, as a free 
parameter. They show the fit error EF and the values of a and 0b as f, is 
varied from its original optimum value. Figure 5 corresponds to the 
flat fade in Fig. 4 and Fig. 6 to the sloped fade. 

The shapes of the curves in Figs. 5 and 6 are typical of those 
obtained when the channel has no minimum in band. The weighted 
error in the fit, E, is not very sensitive to the estimate of f,, the 
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Fig. 4—Two channel scans that produce realization difficulties. 
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Fig. 5—Locus of weighted fit error and model parameters with f, as a free variable for 
flat fade in Fig. 4. 


frequency of the modeled fade minimum. The minimum of EF is broad 
and flat, due to quantization and instrumentation noise in the channel. 
The variation of the parameter a with f, is also typically very gradual. 
The salient features of the variation of b with respect to f, are clearly 
seen in Fig. 6, and are also present and labeled in Fig. 5. As f, is varied 
from its original optimum value, 6 varies from a value of 1 to a value 
of 0 in a sideways s-curve with two stationary points, a maximum and 
a minimum. Extensive simulations with known channel characteristics 
indicate that a good choice of parameters is the set corresponding to 
the point where b is locally minimized. To illustrate this point, assume 
that the channel shape is that given by the model, with 6.3-ns delay, 
fo at 18.5 X 1.1 MHz, a = 0.04, and b = 0.7. One can construct a plot 
similar to Figs. 5 and 6 for this simulated fade, with the result shown 
in Fig. 7. The curves in this figure illustrate the results cited above, in 
that the true value of f, occurs near a minimum value of b. A better 
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Fig. 6—Locus of weighted fit error and model parameters with f, as a free variable for 
typical scan in Fig. 4. 


choice for the case shown and for others that have been simulated 
would be “on the shoulder” between the minimum and 6 = 1; however, 
such a criterion is difficult to quantify. 

To summarize, if the standard routine does not provide a realizable 
fit to a scan, one merely varies f,, the position of the minimum, until 
one obtains a realizable solution with a value of } that is stationary* 
with respect to variations in f,. We recognize that this procedure 
introduces additional sources of error into the estimates of the model 
parameters. The errors in a and 0b are small because 0 is near a 
stationary value and a is slowly varying. The error in f, is also small, 


* Since b is a monotone function of a/, it is only necessary to invert solutions with 
stationary values of the ratio, a/B. 
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usually less than 3 MHz, but is always in the direction corresponding 
to moving the minimum nearest to the band closer. We consider the 
effects of these errors in Section VI. 


IV. MODEL STATISTICS 


Applying the procedures described in Section III to the scans in the 
data base results in 25,000 sets of values of a, b, and f,. The relative 
joint frequency of occurrence of these three parameters may be de- 
scribed by the set of distribution functions shown in Figs. 8 to 12. The 
distribution of the parameter b is described in Fig. 8 in terms of the 
distribution of —20 log (1 — b), which is approximately exponentially 
distributed with a mean of 3.8 dB. This distribution gives the time 
that b exceeds the value given by the abscissa as a fraction of the time 
in a heavy fading month that the rms level in the channel is depressed 
by more than 15 dB. For instance, we see that 40 percent of the time 
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Fig. 7—Locus of weighted fit error and model parameters with f, as a free variable. 
For channel given by model with 7 = 6.31 ns, a = 0.04, b = 0.7, f, = 18.5 X 1.1 MHz. 
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Fig. 8—Distribution of 0. 


when the channel is depressed the value of b exceeds 0.3. It exceeds 
0.7 for 4 percent of that time, and 0.99 about 0.3 percent of that time. 
The distribution of 6 can be modeled in the form 


20 


P(l-b<X) = X°oe? = X23, (13) 


The distribution of a is conditioned on 6b and is approximately 
lognormal as shown in Figs. 9 and 10. The mean and standard deviation 
of the distributions in Figs. 9 and 10 are plotted in Fig. 11. From Figs. 
9 to 11 it is apparent that a and b are almost independent; however, 
less shapely fades tend to occur at less depressed values. We note that 
shape occurs when the average depression is 20 to 25 dB,* that the 


* The value of a corresponds to average power level over a large frequency span and 
not strictly to the average power in a narrowband channel. 
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Fig. 9—Distribution of a conditioned on the value of b for —20 log (1 — 6) less than 
8.5 dB. 


average depression is near 25 dB for 6 greater than 0.7, and that it falls 
off gradually to 15 dB for small b. The distribution of A = —20 log a is 
conditioned on 6 and may be modeled as 


a 


P(A> ¥)=1-P| 5 (14) 


where P is the cumulative distribution function of a zero mean, unit 
variance, and Gaussian random variable, and A,(b) is the mean of A 
for a given value of b as given in Fig. 11. We see from Fig. 11 that the 
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Fig. 10—Distribution of a conditioned on the value of 5 for —20 log (1 — 6) greater 
than 8.5 dB. 


standard deviation of A may be taken as 5 dB regardless of the value 
of b; the variations near —20 log (1 — b) = 20 are due to small sample 
problems. 

Figure 12 shows the time during which scans had f, in 4 X 1.1-MHz 
frequency intervals. It is, in effect, an estimate of the density function 
of the distribution of f, and is, consequently, quite noisy. The maxima 
near + 30 X 1.1 MHz from the center of the band are due in part to the 
movement of estimates of f, to achieve realizability. While, on physical 
grounds, one would expect f, to have a uniform distribution, the fixed 
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delay model is decidedly not a physical model. Consider a simulated 
set of simple three-path fades having a uniform distribution of f,, fixed 
values for a and 8, and a delay 1, fixed at a value other than 6.31 ns. 
This set of fades will engender a nonflat probability density function 
for the f,’s obtained in fitting to the 6.31-ns model. The probability 
density function is flat within the band regardless of the fixed delay of 
the set of simulated fades; however, it will more nearly resemble that 
shown in Fig. 12 if the delay of the set is greater than 6.31 ns than if it 
is less than 6.31 ns. In short, Fig. 12 is characteristic of a channel with 
a considerable fraction of delay differences greater than 6 ns. 

Based on Fig 12, we approximate the probability density function of 
fo by a two-level function. Note that f, is defined on an interval of 
length 1/7, where 7 is 6.3 ns the delay of the model. Thus, with f, 
measured from the center of the band, the probability density function 
for f, may be approximated by 


5r ] 
m7 l#l= | 
Pr (fo) = (15) 
5 e<lhl<= 
3. Ar BM De 


MEAN, Ag 


DECIBELS 


STANDARD DEVIATION 





0 4 8 12 16 20 24 
—20 Log (1—b) 


Fig. 11—Mean and standard deviation of the distribution of —20 log a as a function 
of —20 log (1 — 5). 
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Fig. 12—Time that model parameter, f,, was in intervals of width 4 x 1.1 MHz. 


An extensive examination of various conditional distributions has 
established that there are no other obvious and pervasive dependencies 
among the statistics of the parameters. 


V. CHANNEL DELAY DIFFERENCE 


This section presents some results obtained in estimating the chan- 
nel delay difference. Some techniques described here are used in the 
error analysis in Section VI. Three topics are considered in this section. 
First a simple method is presented of estimating the average delay 
spread in the channel. A second subsection shows that the distribution 
of large delays (larger than 10 ns) can be obtained for a simple three- 
path fade model. The delay distribution is shown to be consistent with 
the estimate of average delay. A third subsection illustrates the prob- 
lem with an observed channel shape that can be matched most 
successfully using a simple three-path model with a delay of approxi- 
mately 26 ns. 


5.1 Mean delay difference in the channel 


The mean delay difference of a channel that can be characterized by 
a simple three-path model is easily estimated. Consider a fade with a 
delay, r. If f,, the frequency of the minimum, is uniformly distributed, 
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the probability that such a fade produces a minimum in a band B Hz 
wide is equal to the ratio of the bandwidth to the spacing of the 
minima, or 


ale = Br. (16) 


If p(t,)Ar is the fractional number of fades having delays between (k 
— 1)Ar and kAz, then the fractional number of fades having a minimum 
in band will be Pmin, where 


Pras = » Bripp (tz) At = Br (17) 
k 
and 
T=Y trp (te) At = | tp(r) dr. (18) 


It follows from eq. (17) that one may estimate the mean delay, 7, 
from a knowledge of Pmin, the fractional number of scans having a 
minimum in a band of width B. Since any method of determining Pyin 
is acceptable, consider estimates of Pmin from the parameters estimated 
using the fixed delay model. The method of estimating the frequency 
parameter in the model involved moving null positions of some fades 
that had out-of-band minima. These fades can be excluded by using 
only the central two-thirds of the band in estimating 7. Of the 24,920 
scans in the data base, 3974 had minima between the 4th and 20th 
frequencies. Hence, 


__ | 3974 1 a an 
7194990 |/16x11x10° - 


One might argue that the mean delay should be estimated for a 
more carefully screened set of scans. Table I shows the mean delay 
estimates obtained from scan populations qualified by having the 
estimate of the model parameter a in a given 5-dB interval. Table II 


Table |—Mean delay for scans selected by value of 
parameter, a 


—20 Log a, dB Number of Scans Scans me aoe in Delay, 7, ns 





ani 


0-5 101 31 17.4 

5-10 725 235 18.4 
10-15 4299 875 11.6 
15-20 6891 1161 9.6 
20-25 7644 906 6.7 
25-30 4184 606 8.2 
30-35 1019 159 8.9 
All 24920 3974 9.1 
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Table Il—Mean delay for scans selected by value of 
parameter, b 
Scans With Min. in Delay, 


—20 Log 1-b, dB Number of Scans Band T, ns 
0-2 10,442 1186 6.5 
2-4 7040 1712 13.8 
4-6 3721 538 8.2 
6-8 1474 191 7.4 
8-10 892 118 7.5 

10-12 527 68 7.3 
12-14 282 28 5.6 
14-16 190 21 6.3 
16-18 146 46 17.9 
18-20 99 32 18.4 
All 24920 3974 9.1 


shows mean delay estimates qualified by the model parameter b, which 
specifies the shapeliness of the fade. 

With several exceptions, the estimated delay spreads given in Tables 
I and II are reasonably constant. One exception is seen for large values 
of b (—20 log 1 — 6 greater than 16). This is consistent with a channel 
for which large differential attenuation across the channel is more 
likely to occur when long delays are present. The existence of such a 
correlation should not be surprising. The other exception is the large 
delays estimated for small values of 6 and for values of a between 0 
and 10 dB. We provide strong evidence of the existence of such a class 
of fades in the next subsection. The existence of this subclass of fades 
suggests that they have a different physical source than the other 
fades in the population. 


5.2 Distribution of delay difference 


To further enhance our knowledge of the distribution of delay in the 
channel, the data base was processed to extract a delay estimate. 
Recall that, for the fixed delay model, parameter estimates are chosen 
to minimize the weighted fit error [E in eq. (9)] for a given fixed r. 
The present calculation was performed for a set of different values of 
7 and the value which produced the smallest weighted fit error and 
corresponded to a realizable fade was designated as the delay for that 
scan if it met certain qualifications. 

Because of the degeneracy in the simple three-path model, changing 
the delay in the fixed delay model will not appreciably improve the fit 
for any scan that can be well approximated by a fixed delay of 6 ns or 
less.’ In performing the optimum delay calculation, the weighted fit 
error was minimized for a predetermined set of delays; the differences 
between adjacent delay values were chosen to be approximately 15 
percent. A given scan was assigned a delay different from 6.3 ns only 
if the third best value of the weighted fit error was at least 0.1 dB 
worse than the best value. (We use the third best value because we 
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must examine three values to detect a minimum.) This criterion sets 
a threshold on the acceptable sharpness of the minimum in the fit 
error with respect to changes in delay. 

The selection criterion was chosen, after several iterations, to insure 
regularity in the estimates derived from successive scans. With the 
chosen criterion, the scans that were assigned a new delay occurred in 
groups of consecutive scans and may be said to constitute fading 
events. During any of these events, the delay was consistent in that 
indicated delays were within +15 percent. If we assume that the 
physical channel does not change between scans, we can associate a 
time with each scan and plot the distribution of the time periods 
during which the characterizing delay was greater than a specified 
delay. 

A series of such plots, conditioned on the concurrently estimated 
value of 6, is given in Fig. 13. The uppermost curve contains the data 
derived from all scans which met the selection criterion; its shape is 
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DELAY IN NANOSECONDS 


Fig. 13—Distribution of optimum delay for simple three-path model, as qualified 


by realizability, the sharpness of minimum, and by several values of the model 
parameter b. 
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dominated by the 627 seconds during which the channel was best 
modeled by a delay of 43 ns (the largest delay in the test set) or more, 
but had little shape (6 < 0.115). These characteristics contribute to 
the large (18 ns) mean delays noted in the previous subsection for 
small values of b. They may be due to quantization but are apparently 
not artifacts of the estimation scheme. Although the origin of this type 
of channel defect is currently not understood, it should not trouble 
any existing radio system. 

It is apparent from the distributions in Fig. 13 that very few scans 
qualified for a new delay with delays less than 10 ns. Consequently, 
the distribution should not be trusted for delays less than 12 or 15 ns; 
beyond 15 ns, it may be interpreted as a lower bound to the true 
distribution. The three curves qualified by the parameter b correspond 
to fades with peak-to-peak variability of 2, 3, and 6 dB. (Peak-to-peak 
variability is 20 log (1 + b/1 — 6), as may be seen in Fig. 1.) If the delay 
were exponentially distributed, the distribution of delay would be a 
straight line on Fig. 13 and would have the form 


P(t>x) =e". (20) 


Fitting a straight line to the three distributions in Fig. 13 for which b 
> 0.115 shows that the average delay decreases with increasing b. The 
corresponding values are 5, 5.5, and 11 ns. Note that this implies that 
b and 7 in a simple three-path model are not independent. 


5.3 An example of along delay scan 


To confirm the existence of long delay scans, consider an event that 
covered approximately 10 seconds on 22 June 1977, from 23 h, 28 m, 54 
s. A representative scan from the middle of this period is shown with 
the fit obtained with the fixed delay (6.3 ns) model in Fig. 14a. To 
emphasize the consistency of this channel condition, an average of the 
channel condition for the central 4.2 seconds (21 scans) of this event 
is compared to the selected scan in Fig. 14b. 

It is apparent from Fig. 14a that the 6.3 ns delay does not have 
enough curvature (delay is too short) to precisely model the channel 
shape. Figure 15 shows the same scan modeled by three-path fades 
having delays of 22.7, 26, and 30.3 ns. The 26-ns fit is the best; it has 
a weighted fit error 0.4 dB better than the 22.7-ns fit and 0.8 dB better 
than the 30.3 ns fit. However, the closeness of all three fits illustrates 
the difficulties in estimating channel delay differences. Visually, one 
would choose the 26-ns model on the basis that the 30.3-ns fit has too 
much curvature and the 22.7-ns fit too little. 


VI. ERROR ANALYSIS 


To verify that the model adequately represents the transmission 
characteristics of the channel, we examine the errors between the 
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Fig. 14—Scan from 22 June 1977, 23 h, 28 m, 48.6 s. (a) Comparison with fixed delay 
model. (b) Comparison with average of scans from 23 h, 28 m, 46.4 s to 23 h, 28 m, 
50.4 s. 


channel as observed and as modeled. In this section we consider the 
statistics of the rms errors and the maximum errors. 


6.1 RMS errors 


A useful measure of the quality of the fit of the model to a given 
channel scan is the root-mean-square value of the decibel error at each 
of the sampled frequencies. Denoting this error as E,ns, we have 


1 1/2 
Enns = E yY. (dB error at fa ; (21) 
n=1 


n#19 


The model parameters were estimated, as described in Section III, to 
minimize the error, E, which is a weighted sum of the squares of the 
power differences at each frequency [see eq. (9)]. The weighting was 
chosen [eq. (10)] so that the error E would approximate the error Ems 
as given by eq. (21).* Indeed, one may show directly that the two 
expressions are equivalent as long as 


[1-2 1« 1 for all n. (22) 


* Note that the parameter estimation problem cannot be solved in closed form by 
minimizing E,ns. 
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Fig. 15—Model fits to long delay scan for three different model delays. 


As we have seen in Fig. 14, this inequality is not always satisfied. 
Consequently, in using E,ns aS a standard of comparison, we are 
evaluating not only how well the model fits the observed channel, but 
also how well we have chosen the parameters to make the match. 

The error E,ms is a desirable quantity to work with because we can 
estimate its distribution under the assumption of perfect matching. 
We observe that if the decibel error were Gaussian with unit variance 
and zero mean, 23 E2,,; would be a x” variable with 20 degrees of 
freedom (to account for the three parameters estimated per scan). 
Observations of a simulated channel with the transmitter and receiver 
back-to-back indicate that the instrumentation errors are approxi- 
mately Gaussian with a standard deviation, o;, of about 0.65 dB. 
Observations of the channel at mid-day with the channel nominally 
flat and unfaded indicate that the standard deviation of the errors is 
between 0.68 and 0.73, varying frequency to frequency and day to day 
by a few hundredths of a decibel. Hence, if we enter a table of the x? 
distribution, Q(x? | 20), with 


4 BE, 
oF 


, (23) 
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we can determine the distribution of E,,.; under the assumption of 
perfect matching.* This distribution is shown as a reference on 
Figs. 16 and 17. It is indicated by a solid curve labeled “ideal” for o; 
= 0.70 and by o’s for a; = 0.75. 

Figure 16 presents the distribution of the rms error for two scan 
subpopulations using the fixed delay (6.3 ns) model. The subpopulation 
of the distribution labeled “standard” consists of all scans that could 
be modeled directly; the distribution labeled “modified” shows the rms 
error distribution for all scans which required an adjustment of the 
frequency of the modeled fade to achieve realizability. Figure 17 shows 
the distribution of the rms error for the composite of all samples using 
the fixed delay (6.3 ns) model. The distribution labeled simple three- 
path model indicates the error distribution that was obtained when 
the scan fitting allowed unqualified variation in model delay to achieve 
the best fit. That is, the calculation described in Section 5.2 was 
performed and the results were qualified only on the basis of realiza- 
bility.+ 

In each case described above, the mean value of the rms error is 
close to the median value. For the two subpopulations shown in Fig. 
16, the calculated mean fit errors correspond to o; values of 0.76 and 
0.85 dB, or the errors are about 0.09 dB larger when a realizable fit is 
obtained by varying the frequency of the model minimum. Comparing 
the composite distributions in Fig. 17, we find that the mean error in 
the fixed delay (6.3 ns) model corresponds to o; = 0.78 dB or about 
0.08 dB higher than that observed when the channel is quiescent. The 
simple three-path model has a distribution of rms error that very 
nearly matches the ideal distribution (with 19 degrees of freedom) for 
oi = 0.75. This is consistent with the instrumentation error imputed to 
the standard distribution in Fig. 16 and is indicative of the instrumen- 
tation error in the presence of multipath fading. It is exceptionally 
good considering that the data are obtained from time sequential 
measurements on a dynamically changing channel. One concludes that 
the modeling error is negligible for the simple three-path model. For 
the fixed delay model under the assumption that the instrumentation 
and modeling errors add in quadrature, the modeling error has a 
tolerable value on the order of 0.2 dB. That is, 


[(0.75)? + (0.2)2]'/? = 0.776. 


The tails of the distributions in Figs. 16 and 17 for large errors are 
of considerable interest. The tails near small values are of little 


* From the central limit theorem, we know that E7,,s will be approximately Gaussian, 
as is x”, regardless of whether or not the measurement errors are precisely Gaussian. 

+ Note that although one cannot always reliably localize the values of the parameters 
in fitting with the simple three-path model (see discussions in Section 2.1 and Ref. 7), 
the error in the fit is always well defined. 
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Fig. 16—Distribution of rms fit error for two scan subpopulations with fixed delay 
(6.3 ns) model. 


consequence; they are distorted by quantization because one cannot 
associate any error with the 12 flat fades included in the data base. 
The deviation of the distributions from the ideal distribution at large 
errors is significant. 

The large deviation of the modified fits in Fig. 16 reflects the failure 
of the fixed delay (6.3 ns) model to accurately fit the long delay fades. 
The tail deviation from ideal is modest down to about the 0.5 percent 
level, corresponding to a few tens of seconds per month. For compar- 
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Fig. 17—Distribution of rms fit error for composite population with fixed delay and 
simple three-path models. 


ison, we note that the rms error of the fit shown in Fig. 14a is 2.3 dB; 
this was the worst fit encountered for the fixed delay (6.3 ns) model. 
However, even in this case the model failure is hardly describable as 
severe. The model of the channel is depressed by 40 dB and has 9.5 dB 
of gain slope; the actual channel is depressed by 39 dB and has 11 dB 
of gain slope. Also, we note that the 6.3-ns delay model has the 
response minimum at about the same frequency as the best represen- 
tation, the 26-ns delay model shown in Fig. 15. 
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The deviation of the tail of the error distribution for the three-path 
fade (Fig. 17) reflects the fact that there are fades that even this model 
has difficulty in fitting. An example of such a fade is shown in Fig. 18 
along with the fit provided by the fixed delay (6.3 ns) model. The same 
rms error (1.6 dB) is obtained for all values of model delay between 
0.05 and 9 ns; the fit degrades for larger delays. Either more than three 
rays are needed to describe the channel shape in Fig. 18, or the channel 
is so depressed that the amplitudes in the notch are distorted due to 
closeness to the noise level in the measuring equipment. The scan 
shown in Fig. 18 is one of three similar scans and has little statistical 
significance. 


6.2 Maximum errors 


Another type of error that can be used to judge the quality of the fit 
of the model to the channel is the worst-case error. That is, after fitting 
to each scan, one records the magnitude of the largest difference (in 
decibels) between the observed channel shape and the shape calculated 
from the model. The following paragraphs consider the distribution of 
these worst-case errors. 

As in the preceding subsection, we can calculate an ideal distribution; 
however, the ideal distribution is not as realistic in this case since it is 
strongly dependent on the tails of the distributions of the individual 
measurement errors. We assume that each power measurement had 
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Fig. 18--Severe fade observed on 22 June 1977 at 22 h, 29 m, 8.6 s. 
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an error in decibels that was Gaussian, with zero mean, a standard 
deviation of V20/23 o; to account for the three parameters estimated 
from the 23 observations per scan,* and that the errors are independent 
frequency to frequency. If the probability of any one measurement 
having an error less than x is denoted by P;(x), the probability that all 
23 have values less than x is 


P3(x) = [Pi(x)]”. _ (24) 


This is the probability that the maximum error is less than x, whereas 
we want the probability that it is greater than x which we denote 
Qo3(x). It follows immediately from eq. (24) that 


Qo3(x) =l]- [Pi(x)]}? 
=1-[1- Q,(x)]”. (25) 


The distribution given by (25) is used as a reference in Figs. 19 and 
20, which show the distribution of the maximum error for the same 
cases as in Figs. 16 and 17. Since the tails of these distributions are 
well behaved for larger errors, the distribution of the maximum errors 
is apparently dominated by the instrumentation noise. That is, if we 
use for the standard deviation of the measurement noise the value 
obtained from the mean of E,ms; for one of these cases (as given in 
Section 6.1), the resulting worst-case error distribution calculated with 
eq. (25) will closely match the observed maximum error distribution. 


Vil. CONCLUSIONS 


By analyzing the errors in fitting the observed channel characteris- 
tics in Section VI, we demonstrated that the simple three-path fade 
model is indistinguishable from a perfect model of a line-of-sight 
microwave radio channel. 

The simple three-path model was used in Section V to characterize 
the channel delay difference. By two different methods, it was shown 
that, when there is 3 dB or more shape present in the channel, the 
average delay difference is between 5 and 8 ns. We developed a lower 
bound on the tails of the distribution of delay difference. From these 
results, which are shown in Fig. 13, we observe that a differential 
channel attenuation in-band of 3 dB or more may be due to delay 
differences as great as 43 ns. In another dimension, one would expect 
to see differential attenuation of 3 dB or more in-band due to delays 
greater than 20 ns for at least 70 seconds in a heavy fading month. 
This is comparable to the time the channel attenuation at a single 
frequency exceeds 40 dB. 


* For comparisons with the three-path model, it is appropriate to use V19/236;. 
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Fig. 19—Distribution of maximum (dB) fit error for two scan subpopulations with 
fixed delay (6.3 ns) model. 


From the error analysis in Section VI, we also conclude that the 
fixed delay (6.3 ns) model is a very good approximation to the channel 
for all observed conditions. This conclusion is further substantiated by 
Figs. 14 and 18, which show the scans for which the fits with the fixed 
delay model exhibited the largest rms fit error (2.3 dB) and the largest 
maximum error (3.9 dB), respectively. The fixed delay model is pref- 
erable to the three-path model for channel modeling because it requires 
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Fig. 20—Distribution of maximum (dB) fit error for composite population with fixed 
delay and simple three-path models. 


only three parameters, and these can always be uniquely determined 
from a channel amplitude scan. 

The statistics of the parameters of the fixed delay model as described 
in Section IV and shown in Figs. 8 to 12 provide the means of 
statistically generating all the channel conditions that one expects to 
see on a nominal hop channel operated at 6 GHz. If one determines, 
by laboratory test, the parameter values that will cause a particular 
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error rate in a digital radio system, one can easily calculate the time 
during a heavy fading month that the error rate will equal or exceed 
this critical value. A companion paper describes the laboratory test 
and the required calculations.® 

Future work will be directed toward verifying the model and model 
statistics with additional fading data obtained both at 6 GHz and at 4 
GHz. Using coherent data obtained in 1973, it will be possible to 
determine the extent to which the channel is actually a minimum 
phase channel. 
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APPENDIX 
Estimation of Parameters 


The problem of estimating the parameters a, $8, and f, in Section III 
is equivalent to the problem of determining the first three terms in a 
subharmonic Fourier series expansion of a function in the frequency 
domain. Since such expansions are not standard, we provide a complete 
description of the methodology here. 

From eqs. (4) and (9), we may express the weighted mean-square 
error between estimated and observed power as* 


7 ¥ Cal Yn — a + B cos(wn — wo)T)? 





E 5G, (26) 
For simplicity, we use a normalized weighting function, d,, defined by 
dy = a (27) 
so that 
¥ dn = 1. (28) 
In terms of the normalized weighting we may write (26) as 
E=Y d,(Y,n — a + B cos(wn — w)7)? (29) 


* Throughout this appendix, all summations are taken over all values of n correspond- 
ing to all frequencies observed in a scan. 
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or in expanded form as 
E =Y dnYi + a’ + BY dicos*(wn — wo)T 
+ 2B Y¥ dnYncos(wn — wo)T 
— 2aB ¥; d,cos(wn — wo)t — 2a ¥ dnYn. (30) 
The error E is a minimum when a, £, and wo are chosen so that the 
partial derivatives of E with respect to a, B, and wo are all equal to 


zero. Setting the partial derivative of eq. (30) with respect to 8 equal 
to zero and solving for £ gives 


_ a Y; dncos(wn — wo)T — Y) dn Ync0S(Wn — wo)T 
¥ dacos”(wn — wo)T 


B (31) 


Substituting (31) into eq. (30), we find Ez, the error minimized with 
respect to B, as 


1 
¥ dnc0s*(Wn — wo) T 


-{a°[¥ dncos’ (wn — wo)t — (Y, dncos(Wn — wo)T)"] 

— 2a [(¥, dr Yn)(¥, dncos? (wn — wo)T) 

— (¥; d,cos(@n — wo)T)(¥ dn Yncos(wWn — wo)T)] 

— (¥ dnYncos(wn — wo)T)”}. (32) 


Minimizing this with respect to a requires that we set the partial 
derivative of HE, with respect to a equal to zero. This gives 
(> dn Yn) (>) d,Cos’ (wp, Te wWo)T) a 


= (> dnc0S(Wn — wo)T) (>; An YnCOS(Wn — wo)T) 


TS ahcos (an al 1 Goo SG (33) 


Substituting (33) into (32) gives E.,, the error minimized with respect 
to both «a and £, as 
Ea = dnYn — Y? 
_ (Y da(Yn — Y)cos(wn — wo)r)? (34) 
¥ dncos?(wn — wo)t — (Y dncOS(wn — wo)T)”’ 


where 
¥=¥ daVn. (35) 


We note that we could have obtained this same expression by first 
minimizing with respect to a and then with respect to 8; however, one 
obtains different but equivalent expressions for a and £, depending on 
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the order of differentiation. We develop the alternative expressions for 
a and £ in the following paragraphs. 

Let us define some new quantities to simplify these expressions. Let 
the difference between the observed power and the weighted mean 
power in the band be designated by X,,; then 


Kn = Ya din = Yury. (36) 
If we also define the quantities 
Xe = ¥ dnaXnCOS Wn, (37) 
Xs = ¥} d,XnSin wnt, (38) 
Da = Y, dncos?(wn — wo)T, (39) 
Ds = Y, dncos(wn — wo)T, (40) 


we may rewrite a and £ from eqs. (31) and (33) as 


— [X,.cos wot + X;Sin wot ]De 


a= Y D.= D? (41) 
and 
1 = 
B= D. {(a — Y)Ds — (X.cos wot + Xssin woT)}. (42) 
Using (41) to eliminate a from (42), we obtain 
X,-COS WoT + X,SiN WoT 
es os dl i 
We may use (43) in (41) to obtain 
a=Yt BDo. (44) 


Equations (43) and (44) are the estimators that would have been 
obtained if the order of taking partial derivatives in the preceding 
development had been reversed. It is apparent that, after one has 
estimated wo, one may estimate a and £ by using either eqs. (41) and 
(42), (48) and (44), or eqs. (41) and (48). 

The estimate of wo that minimizes the weighted error is obtained by 
minimizing Ez, with respect to wo. Using eqs. (35) to (40) in eq. (34), 
we write : 


[X-cos wor + X,sin wor |” 
Daz _ D} 


To see the explicit dependence of Eg, on wo, we define the following 
quantities 


de = Y. dncos’wnt — (¥ dncos wnt)’, (46) 


Epa = Yi dnX? — (45) 
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d, = ¥\ dasin’wrt — (Y dasin wn)”, (47) 
des = Y, dnCOS WaT SIN WaT — (Y dncos wnt)(Y, daSin wnt). (48) 
Substituting these into (45) gives 


[X-cos Wat + X.sin wot] 


pa = 2 UnXn d-cos” wot + 2d-sCOS WoT SIN WoT + d,Sin?worT 


(49) 
Setting the partial derivative of Eg,, as given by (49), equal to zero 
gives the estimator for wo as 


d.Xs = dis Xe 
WoT = Tan"| |: (50) 


d,X- a des Xs 


Obviously, two values of wor in the interval (—z, 7] will satisfy 
eq. (50). One of these, the principal value, lies in the interval (—7/2, 
a/2|, the other differs from the first by +7. We shall show that the two 
solutions are equivalent, but that our chosen solution is unique. 

If we replace wot by wot + 7 in eqs. (39), (40), (43), (44), and (45), we 
see that D, and £ change sign and a and E,, are unchanged. Since we 
want the solution with £ greater than zero, we take the principal value 
solution to (50) if the resulting estimate of £ is positive. Otherwise we 
add or subtract 7 to obtain a positive value for 8 and a value of wor in 
the appropriate interval. 

While we could substitute the result of eq. (50) into (49) to obtain 
the minimum error, E,2.,, it is more generally useful to evaluate Eug 
for the optimum wo. This is especially true when we do not use the 
optimum w, as given by eq. (50). The simplest form for F., is obtained 
by substituting (43) into (45) to give 


Eup = ¥ dnXn — (Da — Dé) 6’. (51) 


These equations were implemented, with the modifications de- 
scribed in Section 3.2, to obtain all the fits described in this paper. 
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A statistical model (introduced in a companion paper) of fading on 
a radio path ts used with laboratory measurements on a digital radio 
system to estimate the outage due to multipath fading, where outage 
is the time that the bit error rate (BER) exceeds a threshold. Over the 
range of BER of interest (10~° to 107°), the calculated outage agrees 
favorably with the outage observed during the period for which the 
fading model was developed. It is further shown that the calculated 
outage, when scaled to a heavy fading month on the basis of single- 
frequency, time-faded statistics, agrees equally well with the outage 
observed on the same path during a heavy fading month. The agree- 
ment between measured and predicted outage substantiates the se- 
lective fading model. The prescribed laboratory measurements char- 
acterize the sensitivity of the radio system to selective fading. Thus, 
the methodology provides a useful basis for comparing the outage of 
alternative realizations of digital radio systems. 


I. INTRODUCTION 


Present interest in using high-speed common carrier digital radio” 
has precipitated a need for estimating the performance of such systems 
during periods of selective (multipath) fading. This paper describes a 
method of characterizing a digital radio system in the laboratory which 
allows the outage to be accurately estimated. For a digital radio 
system, outage requirements are stated in terms of the number of 
seconds in a time period (usually a heavy fading month) during which 
the bit error rate (BER) may exceed a specified level; typically, 107° or 
10~‘ is appropriate to voice circuit application. 

The method is based upon a statistical channel model® developed 
from measurements on an unprotected 26.4-mile hop in the 6-GHz 
band in Palmetto, Georgia in 1977 using a general trade 8-Psk digital 
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radio system as a channel measuring probe. The modeled fading 
occurrences were scaled to the basis of a heavy fading month using the 
occurrence of time faded below a level at a single frequency as the 
means of calibration. The bit error rate performance of the digital 
radio system was measured during the time period used for channel 
modeling and for an extended period corresponding to a heavy fading 
month. This same radio system was later subjected to a measurement 
program in the laboratory using a multipath simulator which provides 
a circuit realization of the fading model. The measured results are used 
with the channel model to determine the occurrence of channel con- 
ditions which will cause the BER to exceed a given threshold. Compar- 
isons on the basis of the modeling period and a heavy fading month 
show good agreement between calculated and observed outages for 
BERs between 10~° and 107°. 

The properties of the fixed-delay channel model are reviewed briefly 
in Section II as a basis for describing the measurements and for the 
subsequent outage calculations. This three-parameter channel model 
is realized in the laboratory by an 1F fade simulator. The simulator 
and its use in obtaining the necessary laboratory data are described in 
Section III. The procedures to be followed in calculating outage times 
for a given BER are described in Section IV. Calculated and observed 
outage times are compared in Section V. Conclusions are provided in 
Section VI. 


ll. MODEL DESCRIPTION—-METHODOLOGY 


It has been demonstrated® that the complex voltage transfer function 
of a line-of-sight microwave radio channel is well modeled by the 
function 


H(w) = a [1 — be Yer] (1) 


with 7 fixed. A 6-GHz channel (30-MHz bandwidth) has been charac- 
terized statistically by the model with t = 6.3 ns. Such a channel has 
a power transfer function given by 

| H(w) |? = a7[1 + b? — 26 cos(w — wo)r] (2) 


and an envelope delay distortion function, i.e., the derivative of the 
phase of H(w) with respect to w, given by 


br(cos(w — wo)t — 5) 


Teves 1+ 6? — 2b cos(w — wo)T 


(3) 


In the following paragraphs, we summarize the properties of the model, 
the statistics of the model parameters, and the measurement objec- 
tives. 
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2.1 Fixed delay model 


A plot of the attenuation produced by the fixed delay model of eq. 
(1) is shown in Fig. 1. Since the delay is fixed at 6.3 ns, the spacing 
between nulls of the response, 158.4 MHz, is much larger than the 
channel bandwidth. The parameters a and 6 control the depth and 
shape of the simulated fade, respectively. The parameter /,(=w0/27) 
determines the position of the fade minimum or notch. Both the notch 
frequency, f., and the response frequency, f, are measured from the 
center of the 30-MHz channel for convenience. 

The model function of eq. (1) may be interpreted as the response of 
a channel which provides a direct transmission path with amplitude a 
and a second path providing a relative amplitude b at a delay of 6.3 ns 
and with a phase of wot + 7 (independently controllable) at the center 
frequency of the channel. This interpretation is represented in Fig. 2 
by a phasor diagram at w = 0, the center frequency of the channel. 
Varying the frequency, w, over the channel bandwidth (30 MHz) moves 
the angle of the interfering ray through an arc of about 60 degrees (27 
x 30 MHz X 6.3 ns = 7/3), centered at the position shown. This 
diagram is useful for understanding the fade simulation; it also provides 
an alternate means of describing the position of the notch. The notch 
position may be specified by its frequency, f,, or by ¢, the angle of the 
interfering path at the center of the channel. 


—20 Log |H(w)| 


1 
—20 Log (1—b) 
| 


CHANNEL 
BANDWIDTH 





—100 —50 0 50 100 150 200 
f—fo, MHz 


Fig. 1—Attenuation of channel model function, H(w) = a[1 — 6 exp(—j(w — wo)r)], for 
7=6.3 ns, a= 0.1, b = 0.7. 
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{H(oo)] 3 = 29 (15 MHz) 


f = —15 MHz 


Fig. 2—-Phasor diagram of a modeled fade, ¢ = 45°, b = 0.7. 


From Figs. 1 and 2, or eq. (1), it may be seen that varying a changes 
the overall level and varying b changes the shapeliness of the modeled 
fade. Furthermore, if the minimum is within the 30-MHz channel 
bandwidth (|¢|< 30°), the fixed delay model can generate notches 
with a wide range of levels and notch widths. With the minimum out 
of band, it can generate a wide range of combinations of levels, slopes, 
and curvatures within the channel bandwidth. 


2.2 Model statistics 


The statistics of the model parameters were obtained from a selected 
data base during which heavy fading activity was observed.® The 
distribution of b is best described in terms of B = —20 log(1 — 6). 
Figure 3 shows the distribution of B and the least-squares straight line 
fit to the distribution over the region where it best represents selective 
fading—between B values of 3 and 23 dB. The channel is described by 
B greater than 23 dB for less than 0.15 percent of the observed time 
which makes the distribution less certain beyond this point. At the 
other extreme, during the periods of time when there is little or no 
selective fading, the channel is characterized by values of B less than 
3 dB. Thus, the fitted line represents a lower bound on the distribution 
for B less than 3 dB. Since the fitted line has an intercept of 5400 
seconds, we may model the fraction of 5400 seconds during which B 
exceeded a value X by the probability distribution 


P(B > X) =e **, (4) 

Thus the probability of finding a value of B between X and X + dX is 
aX 

pB(X) dX = 38 er. (5) 


The distribution of a is lognormal with a standard deviation of 5 dB 
and a mean that is dependent on B (or b). Hence, the probability that 
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_—— FIT: 5400 e~*/3 8 


SECONDS DURING WHICH B EXCEEDED ABSCISSA 





0 5 10 15 20 25 30 35 
X IN DECIBELS 


Fig. 3—Distribution of B for model data base period. 


A = —20 log a has a value between Y and Y + dY is given by 
adY 5 
pa(Y) dY = —— e bY 401/50, 6) 
SV20 


The relationship between Ao, the mean of the distribution, and B is 
given in Fig. 4. 

The distribution of f, is found to be independent of A and B. It is 
usually simpler to work with ¢ rather than f,. The two variables are 
simply related in that ¢ is defined on the interval (—7, 7) and a 2.5- 
degree change in ¢ corresponds to a 1.1-MHz change in f,. For the 
fixed delay model, the variable ¢ has been found to have a probability 
density function that can be described as uniform at two levels, with 
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DECIBELS 


STANDARD DEVIATION 
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0 5 10 15 20 25 30 
= —20 Log (1—b) IN DECIBELS 


Fig. 4—Mean and standard deviation of the distribution of A = —20 log a as a 
function of B. 


values less than 7/2 being five times more likely than values greater 
than 7/2. Thus, we have the probability density function per degree 
as: 


Ps(o) = (7) 
1 ° 


The functions in eqs. (5) to (7) can be used to determine the 
probability of finding a, 6, and f, in some region of a-b-f, space. This 
probability can be converted to number of seconds in the observation 
period by multiplying by 5400 seconds. To convert this probability to 
the number of seconds in a month requires scaling the data base. The 
scaling may be obtained from Fig. 5 which shows, for several frequen- 
cies in the band, the time during the model data base period that the 
channel was faded below a given level. Distributions are shown for 
average power in the band and for power at selected frequencies at the 
center and near the edges of the radio channel. (Frequencies indicated 
are at IF where the center frequency is at 70 MHz.) For the path used, 
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FADE DEPTH IN DECIBELS 


Fig. 5—Amplitude distributions for model data base period. 


one expects the received voltage (single frequency) relative to midday 
average to be less than L for 72.5 X 10* L” seconds in a month.’ For the 
data base used, the fading is best described by 48 x 10‘ L” hence, the 
data represent *% of a fading month. To obtain outage on a seconds- 
per-heavy-fading-month basis, the probabilities calculated with eqs. 
(5) to (7) must be multiplied by 5400 x 1.5 or 8100 seconds. 


2.3 Outage estimation 


The fixed delay model described above can be simulated with an 
equivalent circuit laboratory measurement to determine the equip- 
ment response to multipath fading. Conceptually, one determines 
critical values of A and B for which a specified error rate is achieved 
for each fade notch position. In practice, it is difficult to maintain a 
constant BER; it is more expedient to fix 6 and vary the carrier-to-noise 
ratio (a) while plotting the BER. From the resulting curves, one may 
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compute critical contours of A and B for each prescribed notch location 
and BER. Using eqs. (5) and (6), the probability that A and B lie on the 
high error rate side of a given critical contour may be calculated. 

By repeating this calculation for a uniform set of notch positions 
and using (7) to determine the probability weighting given to each and 
summing, one may estimate the probability of all selective fades that 
produce a BER exceeding the prescribed one. Multiplying this proba- 
bility by 5400 gives the outage time expected over the data base period; 
multiplying by 8100 gives the expected outage time per heavy fading 
month. 

The following section describes the laboratory measurement; Sec- 
tion IV describes the reduction of the measured curves and parameters 
to outage times. 


ll. LABORATORY MEASUREMENTS 


Figure 6 illustrates stressing of a digital radio system by means of an 
1F fade simulator. The simulator, which is inserted after linear IF 
preamplification but before any high-gain amplification, shapes both 
the desired signal and the effective received noise. It is necessary to 
operate the simulator at a sufficiently large input carrier-to-noise ratio 
that the concomitantly shaped noise at its output remains a negligible 
contributor to degraded system performance throughout the operating 
range of interest. 

Within its restricted frequency range of operation, the IF simulator 
is adjusted to achieve those specific shapes implied by Fig. 2. Although 
the measurements could have been made using an RF fade simulator, 
the choice of an IF simulator was based primarily upon considerations 
of signal and noise levels, and the repeatability of adjustments. The 
following section describes an IF shape-stressing measurement in the 
minimum detail necessary to qualify the data collected. 


3.1 Representative IF two-path fade stressing measurement 


The block diagram of Fig. 7 illustrates an arrangement employing 
an IF fade simulator and an IF flat noise source. A pseudo-random test 


RF TRANSMITTER, IF FADE 
MODULATOR SIMULATOR 
DATA PATTERN BER TEST 
GENERATOR RECEIVER 
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Fig. 6—1F fade stressing. 


pattern modulates the 6-GHz radio transmitter whose output is nom- 
inally 5 watts (@, in Fig. 7). The output spectrum is usually shaped by 
a bandpass filter following the microwave power amplifier to comply 
with FCC regulations. 

To enable back-to-back operation of the transmitter and receiver of 
a single repeater which normally operate on different radio channels, 
a radio test translator was employed. The translator output power was 
approximately —30 dBm (adjustable, at () to simulate the unfaded 
received signal level (RSL) observed typically in the field. 

Assuming a linear RF-IF conversion gain of 20 dB, the signal power 
at the input (©) to the 1F fade simulator is -10 dBm. The simulator 
incorporates low-noise linear amplification. A reference insertion loss 
for the main unfaded ray is 10 dB, including the output power summer. 
Hence the maximum desired signal power at the input to the main IF 
amplifier (©) is —20 dBm. 

Assuming a 30-MHz receiver noise bandwidth and a current-art 
receiving system noise figure of 5 dB, the total system noise power is 
approximately —95 dBm, referred to the receiver’s input port. This 
results in a flat receiver noise contribution of —85 dBm at input © to 
the main IF amplifier. Consequently, the maximum attainable carrier 
to simulator-shaped RF noise ratio is 10 log (Co/N,;) = —20 — (—85) = 
65 dB. The noise contributed by the fade simulator amplifiers must 
not exceed —100 dBm, to be negligible. 

Flat IF noise much larger than the unwanted and shaped system 
noise is added artificially at © and is adjusted in magnitude by a 
calibrated attenuator @ to superpose thermal noise degradations upon 
the simulated selective fading degradations of the desired signal. One 
would ideally measure the added IF noise power in the final predetec- 
tion bandwidth of the digital radio system, or twice the Nyquist 
bandwidth. It is more convenient in the laboratory to reference carrier- 
to-noise ratios to the output of the main IF amplifier by using the 
precalibrated acc voltage (assuming that wideband acc detection is 
employed), to measure both the unshaped signal and noise powers. 
The carrier-to-noise ratio at the detector would be higher—by the 
ratio of the system noise bandwidths that would be measured at the 
respective points. 

The noise source output in Fig. 7 may be adjusted so that an 
attenuator setting of 0 dB @ results in a noise power equivalent to 
that of the unfaded signal power (the attenuator is then calibrated 
directly in uncorrected Co/N,, in dB). As the 1F fade simulator is 
readjusted to achieve different prescribed fade shapes, its mean inser- 
tion loss may also change. The change in insertion loss is determined 
by monitoring the change in signal power at ©; the same loss increment 
(dB) must be added to the noise attenuator @) to reestablish the 0 dB 
reference. 
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Fig. 7—Laboratory test arrangement. 
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3.2 IF two-path fade simulator 


Figure 8 illustrates splitting the desired IF signal into an arbitrarily 
phased, adjustable “main” component and a “delayed” component 
fixed in delay (7 ns) but adjustable in magnitude. The main component 
is further resolved into orthogonal components (inset to Fig. 8) using 
wideband networks exhibiting flat gain and well-behaved delay. A 
particular sum vector is constructed by adjusting the orthogonal 
components to establish a simulated fade notch frequency; in practice, 
the phase sense of 0- and 90-degree components are independently 
reversible, as indicated by the switches in the figure, for complete 
flexibility in notch frequency selection. 

The 6.3-ns fixed delay added to the delayed path imparts a phase 
shift of 159 degrees at the 70-MHz 1F center frequency. This is shown 
built out to 225 degrees, relative to the 0-degree transmission path, 
using a 66-degree wideband network of the same type. The delayed 
vector is fixed in direction opposite the midrange position of the 
adjustable main vector, corresponding to a channel-centered fade (¢ 
= 0 degrees). 

Since 1/7 = 158.4 MHz, a change of 1 degree in direction of the main 
vector corresponds to a frequency displacement of the fade notch 
location of 0.44 MHz. For @ = —45 degrees, the notch location is 
displaced 19.8 MHz below the channel center (f, = —19.8 MHz). The 
magnitude of the delayed component is then adjusted to achieve the 
desired notch depth. 
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Fig. 8—1F fade simulator—conceptual block diagram. 
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3.3 Digital radio performance stressed by in-band selectivity and thermal 
noise 

The radio equipment was measured at uniformly spaced notch 
frequencies separated by 4.4 MHz (A¢ = 10 degrees). To fully charac- 
terize a period of variation in ¢, or f,, one would need to make 36 sets 
of measurements. Ideally, half may be omitted because of symmetry. 
For given values of A and B, the same error rate should obtain for a 
notch at a given frequency displacement above or below the channel 
band center. Variations in B ought not to have a significant effect for 
|| greater than 90 degrees. It was determined that detailed measure- 
ments were required for nine different values of f, to characterize the 
digital radio tested. 

Using a wideband RF fade simulator in the field, the digital radio 
performance for out-of-channel notch locations was relatively inde- 
pendent of whether minimum or nonminimum-phase fade simulations 
were employed. The nonminimum phase fade is modeled by eq. (1) 
with the sign of the phase term reversed. This leaves the amplitude 
feq. (2)] unchanged, but reverses the sign of the envelope delay 
distortion [eq. (3)]. We conclude that the minimum phase channel 
model is sufficiently general for use in simulating the channel and in 
estimating performance. 

The 1F fade simulator was adjusted for each notch frequency, and 
the depth of notch was varied by adjusting the magnitude of the 
delayed component. Then various amounts of IF thermal noise were 
added. Figure 9 typifies the performance data collected. BERs are 
plotted versus the uncorrected IF carrier-to-noise ratio (C/Ni,), for a 
constant fade notch offset from midchannel (f, = —19.8 MHz, + = 6.3 
ns). Each curve corresponds to a different notch depth (B = —20 
log(1—b) dB), and hence a different amplitude and delay shape in the 
radio channel. Each curve is also identified with an in-band selectivity, 
defined as the difference between the maximum and minimum atten- 
uation present in the (25.3-MHz) channel bandwidth. The lower-left 
“baseline” curve presents the unshaped signal, flat fading performance 
obtained by adding only IF thermal noise. This curve was verified 
(without the added IF noise) by attenuating the received RF input 
signal in the back-to-back configuration. 

Consideration was given to matching the order of measurements to 
characteristics of the particular digital radio tested. For example, 
considerable scattering of data at low error rates can result from 
synchronizations involving different reference carrier phases. The au- 
thors elected to perform several synchronizations while observing the 
BER for each phase, and then chose that phase giving the worst 
performance.* Synchronization was accomplished at the low error rate 


* Because the phase information in the measured system was Gray coded and digital 
access was on a per-rail basis, one rail had twice the BER of the other two. All 
measurements in the field and in the laboratory were made on this worst-case rail. 
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ie 9—High-speed digital radio 1F dispersive fade simulations, 7 = 6.3 ns, fo = —19.8 
z. 


(bottom) of each curve, and this phase relationship was maintained 
for all data points obtained for each curve. 

From the baseline curve of Fig. 9, a BER = 1 X 10°° obtains for 10 
log(C/Nif) = 21.5 dB. For the digital radio system installed on the 
instrumented hop and reported in the figure, the measured flat fade 
margin for a threshold BER = 1 X 10° was 40.5 dB This leads to an 
unfaded IF carrier-to-noise ratio 10 log(Co/Ni) = 21.5 + 40.5 = 62 dB. 

From the baseline curve for a BER = 1 X 10~’, note that insertion of 
a fade whose notch depth 1s 6.5 dB results in four orders of magnitude 
degradation in BER performance; equivalently, an in-band selectivity 
of only 5.7 dB in 25.3 MHz results in a BER > 1 x 107°. 

The asymptotic regions in Fig. 9, corresponding to high values of C/ 
Ni, are not normally presented in characterizations of this type; 
however, system outage depends primarily upon the performance in 
these asymptotic regions. Thus, under typical fading conditions, the 
transmitted carrier power might be increased at will without improving 
the BER significantly. The effects of decreasing the carrier power are 
discussed in Section 4.4. 

A family of curves like those shown in Fig. 9 was obtained (but are 
not given here) for each of nine uniformly spaced frequency offsets 
below midchannel to characterize the digital radio system sufficiently 
for the prediction of outage. A number of spot checks were also made 
using both RF and IF fade simulators at symmetrical positive and 
negative offset frequencies, to establish that acceptable symmetry 
existed. 


DIGITAL RADIO OUTAGE DUE TO FADING 1085 


IV. CALCULATION OF OUTAGE 


This section describes four methods of calculating outage. The 
derivation of the critical curves of A and B, which provide the basis 
for making and understanding these calculations, is given in Section 
4.1. In Section 4.2 the detailed calculation of outage from the critical 
A-B curves is described. It is shown in Section 4.3 that for the present 
system this method may be greatly simplified by calculating only 
selectivity-caused outage (i.e., neglecting thermal noise). Section 4.4 
presents an approximate method of accounting for the effects of 
thermal noise. Section 4.5 provides a basis for estimating the selectiv- 
ity-caused outage from a single measurement. 


4.1 Derivation of critical characteristics 


To calculate the outage for a fixed bit error rate, one must first 
obtain the critical curves of A and B at each simulated value of f,, the 
notch position. Thus, from Fig. 9 which corresponds to f, = —19.8 MHz 
(or ¢ = —45°), we obtain six points on the critical curve of A and B for 
a BER of 107°, one point from each of the six curves which cross the 
critical BER. The value of B is obtained from the value of b since 


= —20 log(1 — 0). (8) 


For the curve in Fig. 9 corresponding to B = 4.4 dB, we obtain the 
corresponding critical value of A for a BER of 107° from the value of 
carrier-to-noise ratio, which is 20.2 dB where this curve crosses the 
10-* BER line. Since the carrier-to-noise ratio is 62 dB when the channel 
is unfaded, the 20.2 dB value corresponds to a relative average power 
loss of 41.8 dB, 


Ls = 62 — 20.2 = 41.8 dB. (9) 


Without loss of generality, we assume that the PSK signal has a 
rectangular spectrum of width /,; consequently, the relative power 
transmitted by the model is obtained from eq. (2) as* 


1 athe 
ah 2 
Ps = nfs E | H(w)| dw 





- oft + b? — 2b cos Date wl =) \. (10) 


Tf eT 


*The calculated result is not critically dependent on the flatness of the signal 
spectrum or the spectral width chosen. We have used for f, a value of 25.3 MHz as 
representing the effective width of the signal. 
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Defining a correction term by 





C = -10 le + b? — 2b cos Dafoe = a (11) 
we may express the signal loss as 
Ls = -10 log Paw =A +C. (12) 
Thus, we obtain the critical value of A as 
A=L,-C. (13) 


For B = 4.4 dB (6 = 0.4) and f, = —19.8 MHz, we find C = 2.06 dB and 
the critical value of A is 41.8 — 2.1 = 39.7 dB. 

Carrying out these calculations for the six curves in Fig. 9, one can 
generate the critical curve of A and B for f, = —19.8 MHz and a BER 
of 10-*. The curve is shown in Fig. 10 along with the critical curves for 
several other values of the BER. A complete set of curves must be 
generated for all values of /.. 

The curves in Fig. 10 are typical of the critical curves obtained for 
| fo | = 33 MHz. The intercept with the A-axis represents the flat fade 
margin for the given BER; this margin is independent of notch position. 
The intercept of a critical contour with the B-axis represents the 
shape, or relative fade depth, margin for the given notch position. For 
values of B to the right of this intercept, the critical value of BER 
cannot be obtained at any carrier-to-noise ratio for the given notch 
position. 


4.2 Outage calculation— detailed method 


The probability, P., of finding A and B outside all critical contours 
may be written with eqs. (5), (6), and (7) as 


Po = | Po($)Pc(p) dg, (14) 


where 


P.(¢) -| | Pa(Y)pa(X) dY dX, (15) 
0 JAUx ; 


and A,(X) is the functional relation of the critical values of A to B (or 
X), for B less than B., the B-axis intercept, and for a given BER and ¢ 
value.* Since measurements were made for a uniformly spaced set of 
notch positions with spacing Ad = 10°, we may approximate (14) by 


* The dependence of the function A.(X) and the asymptote B. on BER and ¢ is not 
explicitly denoted to keep notation simple. 
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fy = -19.8 MHz 


20 Log (a) IN DECIBELS 


A=- 





0 5 10 15 20 25 30 
B = —20 Log (1—b) IN DECIBELS 


Fig. 10—Critical curves of A and B for f, = —19.8 MHz. 
P, = Ad Y poldi)Pc(di). (16) 
All 4; 


To illustrate the calculation of outage probability with eqs. (15) and 
(16), we shall calculate the term in the summation of eq. (16) corre- 
sponding to a BER of 10° and ¢; = —45° (or f, = —19.8 MHz). From 
Fig. 11, which is taken from Fig. 10, we note that the double integral 
in eq. (15) may be broken into integrations over two regions. Thus 


P.(¢i) -| | paA(Y)pp(X) dY dX (17) 
B, J-« 


Be fe 
+ i i PaA(Y)pa(X) dY dx, 
0 JAK 


where the two double integrals correspond to integrations over Regions 
1 and 2, respectively, in Fig. 11. Outage due to the occurrence of A and 
B in Region 1 may be described as outage due only to shape or 
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selectivity. In Region 2, outage is due to the combined effects of signal 
loss and selectivity. 

Using eqs. (5) and (6), the integral over Region 1 is obtained as 
e 8/38. The contribution due to thermal noise and shape (Region 2) is 
slightly more complicated. Dividing the interval 0 to B, in Fig. 11 into 
N subintervals, as shown in Fig. 12, the probability of being in Region 
2 is the sum of the probabilities for each subinterval. Thus eq. (17) 
becomes . 


P.(gi) = e778 + 5 [e-Fe-88 — eo 3x35) p, Pa), (18) 
k=1 


where 


foo} 


PAX) =—— | e*” dx. (19) 


Qa Jx 


REGION 2 
——-SELECTIVITY AND THERMAL 
NOISE LIMITED 


BER = 1073 


REGION 1 
SELECTIVITY LIMITED 


A= —20 Log (a) IN DECIBELS 





0 5 Be 10 15 20 25 30 
B = —20 Log (1—b) IN DECIBELS 


Fig. 11—Classification of outage with respect to critical curve for BER = 107°, fy = 
19.8 MHz. 
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20 Log (a) IN DECIBELS 


Az— 





O° Bai B.S Be 10 15 20 25 30 
B = —20 Log (1—b) IN DECIBELS 


Fig. 12—Outage calculation for an incremental interval. 


Evaluating the two components of eq. (18) from Fig. 11, we find 
P.(—45°) = 0.181 + 0.003 = 0.184. (20) 


This calculation was performed for 10 values of ¢; from —5 to —85 
degrees in 10-degree steps. Using these results in eq. (16) and multi- 
plying by two to account for positive values of ¢; which are assumed to 
contribute equally, we find the probability, P, for a BER of 10™° as 


P, = 0.0996. 
The expected outage for the data base period is, then, 
T. = 5400 x 0.0996 = 538 seconds. (21) 


4.3 Outage calculation— selectivity only 


It is apparent from eq. (20) that most of the outage for the system 
under study is caused by selectivity, fades characterized by A and B 
values in Region 1. From eqs. (14) and (17), we may express P,;, the 
probability of outage due to selectivity, as 


1090 THE BELL SYSTEM TECHNICAL JOURNAL, MAY-JUNE 1979 


Pos = i i Ps()pa(X) dX do. (22) 


For the system studied for a BER of 10~°, a finite B. is obtained only 
for | ¢:| < 90 degrees. Hence, we may use eq. (7) to simplify (22)* 
_ 2ho x —BAb)/3.8 
Pos = a6 & e : (23) 
From eq. (23) we see that the outage due only to selectivity depends 
on the relationship between B., the asymptote of critical B values, and 
the notch angle or notch frequency. Figure 13 shows the relationship 
between B. and the notch frequency for four values of BER. It is 
apparent from eq. (22) that the outage probability is the probability of 
finding B and f, values in the region above this curve. Such curves, 
therefore, provide a useful basis for evaluating the selectivity outage 
of a digital radio system. 


4.4 Outage calculation— approximate method 


For a radio system sensitive to both thermal noise and selectivity, 
the calculation of Section 4.3 is inadequate and that of Section 4.2 is 
unduly cumbersome. 

To illustrate a simpler, but generally applicable, method and at the 
same time to provide a useful incidental result, let us evaluate the 
effect of reducing the transmitted power by 10 dB. For the reduced 
power system, the carrier-to-noise ratio would be 52 dB for the unfaded 
channel, and the critical curves of A and B would be shifted by 10 dB. 
Figure 14 shows the critical curve of A and B for a 10-° BER and f, = 
—19.8 MHz with an overplot of the conditional mean of the distribution 
of A. The dotted curves on Fig. 14 represent 2-sigma intervals on 
either side of the mean. From the properties of the Gaussian distri- 
bution, one may determine that more than 95 percent of the values of 
A and B will lie between these two dotted curves. We designate as A, 
and B,, the coordinates of the intersection of two curves: the critical A 
— B curve and the conditional mean curve. Then approximating the 
critical curve of A and B with a straight line segment tangent at (An, 
B,,), with slope s, we may approximate the probability of outage by 
integrating the probabilities over the region to the right of the tangent 
line. Using eqs. (15) and (16), we obtain 


Po=Agd ¥ pated | | PA(Y)pa(X)dY dx. (24) 
A 0 JA,,+s(B-B,,) 


Il ¢; 


* The factor of two is required in eq. (23) because the indicated summation corre- 
sponds to an integration only over negative notch frequencies (¢; < 0). 


DIGITAL RADIO OUTAGE DUE TO FADING 1091 


RELATIVE NOTCH DEPTH, Be, IN DECIBELS 
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Fig. 13—Asymptotic performance curves. Locus of values of f, and B that produce a 
fixed BER at high carrier-to-noise ratio. 


40 
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B = —20 Log (1—b) IN DECIBELS 


Fig. 14—Approximate outage calculation for 10 dB less transmitted signal. 
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Interchanging the order of integration and ignoring* the B dependence 
of Ao(B), this becomes 


P, = Ae y p(oi) e Bm/3.8 90.866/s" (25) 
All 4; 


Evaluating eq. (25) for a 10-° BER and multiplying the result by 5400 
gives an outage estimate for the data base period of 602 seconds. 
Recalculating the total outage time at a 107° BER using the method of 
Section 4.2 [eqs. (16) and (18)] gives 636 seconds, which verifies the 
accuracy of the approximate method. The estimate of 636 seconds was 
calculated as an upper bound; the 602 seconds calculated using (25) 
tend to be a lower bound. We conclude that backing off transmitted 
power by 10 dB would increase the outage by about 12 percent (538 to 
602). 


4.5 A further simplification 


In this section, we show that the outage due to selectivity can be 
estimated approximately for a given BER from a determination of the 
in-band selectivity required (with the notch out-of-band) to produce 
that BER. Such a measurement may provide a useful approximation 
for any digital system using quadrature modulation components;? 
however, we provide a justification based on the performance of the 
system at hand. In-band selectivity is defined as the difference between 
the maximum and minimum attenuation present in the (25.3-MHz) 
channel bandwidth. 

Since the in-band selectivity is a constant for any of the curves 
shown in Fig. 9, one can use Fig. 9 to plot the asymptotic BER against 
in-band selectivity for f, = —19.8 MHz. Such a plot was generated for 
each notch position measured to produce the family of curves shown 
in Fig. 15. Note that, except for notch positions near the band center, 
the BER is uniquely related to the in-band selectivity. Neglecting the 
in-band notches, we find that a 107° BER corresponds to an in-band 
selectivity of 5.5 dB. 

If we use the original model of eq. (2) to determine the values of B 
that will produce an in-band selectivity of 5.5 dB for a number of 
different notch positions, we would generate Fig. 16. It is apparent that 
for this system there is a good correspondence between the curves of 
asymptotic performance (Fig. 13) and the curves of constant in-band 
selectivity (Fig. 16). 

To reinforce this conclusion, we provide Figs. 17, 18, and 19. Figure 
17 shows the locus of in-band selectivity in a 25.3-MHz band corre- 


* Including the effect of the slope of Ao(B) at B = B,, gives the same symbolic result 
with s interpreted as the algebraic sum of the slope of the tangent and dAo/dB evaluated 
at B= Bn. 
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Fig. 15—Measured asymptotic bit error rate vs peak-to-peak amplitude difference in 
a 25.3-MHz band. 
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Fig. 16—Locus of B and f, for modeled fades that have fixed peak-to-peak amplitude 
in a 25.3-MHz band. 


sponding to each of the curves of constant BER in Fig. 13. That is, for 
each BER and each value of notch position, f,, we have plotted the 
peak-to-peak amplitude difference in the band for the corresponding 
value of B., the asymptotic critical value of B. Figure 18 shows a 
similar set of curves with the peak-to-peak delay distortion in a 25.3- 
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IN—BAND SELECTIVITY IN DECIBELS 





f,, DISPLACEMENT OF NOTCH IN MEGAHERTZ 


_Fig. 17—In-band selectivity (in 25.3-MHz bandwidth) corresponding to asymptotic 
critical values of notch depth (B.) for several values of BER. 
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Fig. 18—Peak-to-peak envelope delay distortion in 25.3-MHz bandwidth correspond- 
ing to asymptotic critical values of notch depth (B.) for several values of BER. 


MHz band as the ordinate. Similarly, Fig. 19 has as the ordinate the 
“slope,” or amplitude difference at a separation of 25.3 MHz. It is again 
clear from these three figures that the in-band selectivity is the 
relevant channel impairment giving rise to errors. We see from Fig. 18 
that, for out-of-band notches, high BERs are obtained with very small 
values of peak-to-peak delay distortion, and from Fig. 19 that for in- 
band notches high BERs are obtained for very small values (zero at 
mid-band) of slope. 
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AMPLITUDE DIFFERENCE IN DECIBELS 
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Fig. 19—Amplitude difference at a 25.3-MHz frequency separation corresponding to 
asymptotic critical values of notch depth (B.) for several values of BER. 


The model data base was analyzed to determine the time during 
which the in-band selectivity in a band of 25.3 MHz exceeded a given 
value. Figure 20 presents this distribution for in-band selectivity as 
calculated from the modeled fits. Figure 20 can be used directly in 
conjunction with Fig. 15 to calculate the outage times for the model 
data base.* For instance, from Fig. 15 we note that 5.5 dB of selectivity 
corresponds to a 107° BER. We use Fig. 20 to determine that 5.5 dB 
was exceeded for 520 seconds. 


V. COMPARISONS OF CALCULATED AND OBSERVED OUTAGES 


Using the methods of Sections 4.2 to 4.5, outage times were calcu- 
lated for bit error rates of 10~° to 10° for both the model data base 
period and for a heavy fading month, by multiplying the outage 
probabilities by 5400 and 8100, respectively. 


5.1 Model data base period 


Calculated and observed} outages for the model data base period 
are shown in Table I. In general, comparing the calculated results with 
observed results, we see that the outage is underestimated at high 
BERS and overestimated at low BERs. Any estimation procedure based 
on the current modeled state of the channel will tend to underestimate 


* In practice, one would use a single measurement of in-band selectivity. For instance, 
in Fig. 9 one would take the 5.7-dB value corresponding to the curve asymptotic at a 
10° BER. 

+ Because of quantization, the outage times observed from the field experiment 
correspond to bit error rates of 1.26 x 107°, 1.57 x 10~*, 0.981 x 10°, and 1.19 x 10. 
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. Fig. 20—Distribution of in-band selectivity (25.3-MHz bandwidth) for model data 
ase. 


Table |—Outage in modeling data base period (seconds) 
BER= 10° 10% 10° 10° 


Observed 636 903 1191 1487 
Detailed calculation (Section 4.2) 538 960 1430 =: 1860 
Approximate calculation (Section 4.4) 527 950 1420 =: 1830 
Asymptotic calculation (Section 4.3) 527 950 1420 = 1830 
Selectivity calculation (Section 4.5) 510 900 1570 2730 


outage at high BERs because of hysteresis effects in the radio receiving 
equipment. That is, when the channel condition becomes sufficiently 
severe, the bit error rate becomes high enough (on the order of 107°) 
that the timing and/or phase of the radio system breaks lock. If the 
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channel impairment becomes less severe, the BER will not improve 
until the system resynchronizes. The hysteresis is important at the 
10-° BER, since a significant fraction of the events that cause 10™° BER 
will cause the system to break lock. 

One would expect to overestimate the outage at low BERs because 
of the method of taking data. Recall that, in measuring the curves in 
Fig. 9, it was found that the BER depended on the phase to which the 
system had locked. The recorded performance represented the worst- 
phase condition. At a 10~° BER, the best phase produces a BER that is 
about % that produced by the worst phase; the difference in BER from 
worst to best phase at a 107° BER is negligible. Hence, one would 
expect outage to be overestimated significantly at low bit error rates. 

In comparing the outage calculated from in-band selectivity (Section 
4.5) to the outage observed, we find that the overestimation of outage 
at low BERs is more severe than with the other methods. This is due 
to the greater sensitivity of the differential selectivity method to the 
bias induced by choosing the worst-case phase. For instance, compar- 
ing calculations at a 10°° BER, we find that Fig. 20 is steeper for 
amplitude differences near 2 dB than is Fig. 3 near B values of 3.5 dB. 
(Figure 9 verifies the appropriateness of this comparison). More gen- 
erally, one expects the method based on in-band selectivity to overes- 
timate the outage because the method is based on notches out of band. 
From Fig. 15, it is apparent that, for a given AA, some scans will not 
have the BER specified. 

We conclude that, although calculation of outage from sensitivity to 
in-band selectivity provides quick estimates, they are less accurate. 
The calculation requires knowledge of the distribution of in-band 
selectivity over a specified bandwidth. These statistics are neither 
simple nor generally available. It has been shown,” for instance, that 
slope statistics have a nontrivial dependence upon the measurement 
bandwidth. 

It is clear that the calculations based on selectivity (Sections 4.3 and 
4.5) agree for the system studied here because that system has very 
little outage due to thermal noise limitations, and because it is sensitive 
primarily to in-band amplitude excursions. The extent to which these 
statements are true for other systems is currently unknown. 


5.2 Outage on a monthly basis 


The results in Table I may be put on the basis of a heavy fading 
month by increasing them by a factor of 1.5, as discussed in Section 
2.2. The resulting outages (including the scaled observed outage) are 
compared with the outage observed in a one-month period® in Table 
II. We see that the outage times observed in the total one-month 
period agree well with the values obtained by scaling the observed 
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Table !l—Outage in a heavy fading month (seconds) 
BER= 10° 10% 10° 10° 


Observed (Ref. 8) 1000 1320 2100 2900 
Scaled observation from Table I 955 1350 1790 2230 
Calculation (Section 4.2-4.4) 800 14380 2140 2760 
Selectivity calculation (Section 4.5) 770 1350 2350 4100 


outage for the data base period used in modeling, except for the slight 
divergence appearing at low BERs. This divergence should not be 
unexpected for this equipment. As may be seen in Fig. 15, a 10°° BER 
may be caused by differential amplitude selectivity in band of 2 dB. 
Such modest amounts of selectivity may be expected to occur some- 
times in the presence of very moderate selective fading. The modeling 
data base was constructed by selecting only periods of significant 
selective fading. This reinforces the comments made in conjunction 
with Fig. 3, namely, that the model distribution of B represents a lower 
bound for small values of B which can contribute to outage at the 107° 
BER level. 


VI. CONCLUSIONS 


We have demonstrated the validity of a technique for estimating the 
unprotected outage of a digital radio system due to selective fading on 
a particular hop in the 6-GHz common carrier band. The technique 
required field measurements to statistically characterize the param- 
eters of a model of propagation on the hop. It also requires performance 
data obtained in the laboratory for the radio system by stressing it 
_ with a two-path fade simulator with a differential path delay of 6.3 ns, 
corresponding to the fixed delay channel model. Since the radio path 
on which these measurements were made has a length close to the 
average for the Bell System long haul radio network and has an 
average incidence of fading activity, the channel model is representa- 
tive of a typical path. At the very least, the technique provides a basis 
for determining the relative merits of various digital radio systems 
operating without benefit of space diversity. 

For the system under test, outage was calculated by four different 
methods. Because this system was selectivity-limited rather than 
noise-limited, all four methods predicted approximately the same 
outage as that summarized in Table I; however, the method based on 
in-band selectivity is more severely biased at low BERs. The method 
based on asymptotic performance and that based on in-band selectivity 
can only be used to estimate outage due to selectivity. If the transmit- 
ted power of the system under test were reduced by 10 dB, both of the 
other two methods, the detailed and the approximate method, would 
predict an increase in outage time of about 12 percent. 
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B.S.T.J. BRIEF 


A Counterexample to a Conjecture on the 
Blocking Probabilities of Linear Graphs 


by H.W. BERKOWITZ 


(Manuscript received November 21, 1978) 


It was conjectured by Chung and Hwang that a series-parallel 
regular linear graph is superior to another if its degree sequence 
majorizes the degree sequence of the other. A counterexample to this 
conjecture is given. 

The following definitions are taken from Refs. 1 and 2. Consider a t- 
stage linear graph with a source (the vertex of the first stage) and a 
sink (the vertex of the last stage). All the vertices are arranged in a 
sequence of stages such that, for each edge, one vertex is in stage 1 and 
the other vertex is in stage z + 1, for some 7. Each edge is in one of two 
states, busy or idle. A linear graph is blocked if every path joining the 
source and the sink contains a busy edge. Assume that any edge 
connecting a vertex in stage z with a vertex in stage z + 1 has probability 
pi of being busy for 1 = i = ¢t — 1. For a ¢-stage linear graph, the 
sequence (pi, Po, -+* , Pr-1) is called the link occupancies for that 
graph. One ¢-stage linear graph is superior to another if, for any given 
link occupancies, the blocking probability of the first graph does not 
exceed that of the second. 

Let e be an edge from a vertex a in stage 1 to a vertex b in stage 1 
+ 1. Define A(e) to be the ratio of the outdegree of a to the indegree 
of 6. A t-stage linear graph is regular if, for each i, 1 =i st -— 1, ife 
and f are any two edges between stage z and stage i + 1, than A(e) = 
A(f). In this case, let A; = A(e). Thus a regular linear graph is associated 
with a unique degree sequence (Aj, Az, «++ , Ari). 

A degree sequence (Ai, Ao, +++ , Aci) majorizes another degree 
sequence (Aj’, Ao’, »++ , Ar-i’) if and only if AyAz --- A; =Ar‘/Ag’ +--+ A,’ for 
every i, 1Sist-—1. 

A series-parallel regular linear graph is a regular linear graph which 
is either a series combination or a parallel combination of two smaller 
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series-parallel regular linear graphs with an edge being the smallest 
such graph. 


|. A COUNTEREXAMPLE 


In Ref. 2, Chung and Hwang conjectured that one series-parallel 
regular linear graph is superior to another if the degree sequence of 
the first majorizes the degree sequence of the second. 

The graphs of Fig. 1 are a counterexample to this conjecture. The 
degree sequence of graph (a) is (2, 1, 1, 1/2), the degree sequence of 
graph (b) is (2, 1/2, 2, 1/2). Thus, the degree sequence of graph (a) 
majorizes the degree sequence of graph (b). 


(a) 


(b) 


Fig. 1—Counterexample. 


Let (p1, D2, Ps, Ps) be the link occupancies of the two graphs. Let gq; 
=] — p, for 1 <i <4. Then the blocking probability of graph (a) is A 
= (1 — qigeqsqs)”. The blocking probability of graph (b) is B = (p, + 
P2 — Pip2)” + (ps + Ps — paps)” — (pi + po — pip2)”*(ps + ps — Pspa)”. 
Now let p; = 0.1, for 1< i< 4. Then A = (1 — (0.9)*)? = (1 — 0.6551)” 
> (1 — 0.7)? = 0.09. But B = 2(0.19)” — (0.19)* < 2(0.2)? = 0.08. Thus 
for this set of p,’s, the blocking probability of graph (b) is less than 
that of graph (a); so graph (a) is not superior to graph (b), contradicting 
the conjecture. 
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